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Chapter  1 


Introduction 

The  general  problem  to  be  considered  in  this  dissertation  is  the  rapid  deter¬ 
mination  of  the  time  that  a  particular  distributional  characteristic  or  parameter 
of  a  sequence  of  independent  multivariate  random  data  samples  changes.  This 
topic  has  been  studied  for  applications  in  typical  array  processing  fields  such  as 
radar,  sonar  and  the  processing  of  biomedical  signals,  as  well  as  other  fields  such 
as  fault  detection  in  linear  systems,  signal  segmentation  in  speech  processing,  and 
change  point  problems  in  statistics.  The  problem  is  described  as  estimating  the 
time,  P,  that  the  change  occurs  under  the  statistical  model 

Ftf  i  <  P 

Xi  ~  •  (1) 

F/c  (0ki)  i>  P 

That  is,  Xi,  the  data  for  time  sample  i,  is  independently  drawn  from  the  distri¬ 
bution  F H  (Om)  for  i  <  P  and  from  the  distribution  FK  {&Ki)  for  i  >  P  where 
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the  cumulative  distribution  functions,  FH  and  FK,  respectively  depend  on  time 
dependent  parameter  vectors  9m  and  9m- 

Page  [1]  provides  a  sequentially  based  method  for  the  quick  detection  of  the 
change  from  one  distributional  model  to  another  when  each  distribution  is  ex¬ 
actly  specified  and  stationary.  Page’s  test  may  be  best  described  as  consecutive 
applications  of  Wald’s  sequential  probability  ratio  test  (SPRT).  The  SPRT’s  are 
designed  to  decide  between  two  stationary,  known  hypotheses;  the  signal  absent 
(null)  hypothesis  and  the  signal  present  (alternative)  hypothesis.  These  assump¬ 
tions  imply  that  the  parameter  vectors  of  equation  (1)  are  known  and  constant, 
9m  =  9h  and  9Ki  =  9K.  Appropriately  designed  SPRT’s  will  eventually  termi¬ 
nate,  choosing  the  null  hypothesis  with  a  certain  design  probability,  when  there 
is  no  signal  present.  When  this  happens,  Page’s  test  restarts  the  SPRT  assuming 
that  the  signal  is  not  present  yet.  When  the  signal  does  occur,  the  current  SPRT 
or  a  following  SPRT  will  terminate  at  the  alternative  hypothesis  with  a  certain 
design  probability,  thus,  detecting  the  signal.  Performance  parameters,  differing 
from  the  Neyman-Pearson  concepts  of  detection  and  false  alarm  probabilities  (or 
Type  I  and  II  error  probabilities),  are  the  average  number  of  samples  between 
false  alarms  and  the  average  number  of  samples  before  detection.  Page’s  test 
and  Wald’s  SPRT  are  designed  for  simple  or  point  hypothesis  testing,  meaning 
that  the  statistical  distribution  of  the  data  is  known  exactly  under  both  the  null 
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and  alternative  hypotheses.  It  is  not,  in  general,  clear  how  to  proceed  when  ei¬ 
ther  or  both  distributions  are  not  completely  characterized,  yielding  composite 
hypotheses. 

One  application  of  this  problem  involves  the  detection  of  the  onset  of  a  signal 
received  at  an  array  of  sensors  in  a  background  of  interferences  and  noise.  The 
signal  may  represent  a  radar  or  sonar  target  return,  seismic  disturbance  or  the 
failure  of  a  mechanical  system.  In  frequency  domain  array  processing  applica¬ 
tions,  the  data  is  usually  modeled  as  a  complex  Gaussian  random  vector  [2]  which 
is  completely  characterized  by  its  mean  vector  and  covariance  matrix.  Before  the 
signal  arrives,  the  data  is  assumed  to  have  a  zero  mean  vector.  After  the  signal 
arrives,  under  the  deterministic  signal  model,  the  mean  vector  is  characterized  by 
an  unknown  complex  scalar  (representing  the  magnitude  and  phase  of  a  sinusoid 
arriving  at  the  array)  multiplied  by  the  array  steering  or  replica  vector.  The  array 
steering  vector  is  assumed  to  be  known  and  represents  the  relative  amplitude  and 
phase  effects  of  a  signal  being  propagated  from  a  hypothesized  position  to  the 
array  sensors.  In  both  cases,  the  covariance  matrix  of  the  data,  representing  the 
interfering  signals  and  noise,  is  unknown  and  may  be  slowly  varying.  The  sim¬ 
plest  graphical  example  of  this  shift  in  mean  scenario  is  the  univariate  or  single 
sensor  case  shown  in  figure  1  with  real  rather  than  complex  data. 

Research  related  to  the  array  signal  onset  detection  problem  has  congregated 

into  three  areas:  failure  detection  in  linear  dynamical  systems,  the  application 

\ 

of  Page’s  test  to  simplified  versions  of  the  problem,  and  sliding  block  techniques 
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Shifted  Mean 


Time  Change  Occurs 


Figure  1:  Example  of  a  data  sequence  from  a  shift  in  mean  type  distributional 
change  for  univariate  Gaussian  data  with  non-stationary  variance. 

assuming  short  term  stationarity  with  Neyman-Pearson  based  testing  for  the 
presence  of  a  signal.  Basseville  and  Nikiforov  [3]  have  recently  presented  a  com¬ 
pilation  of  many  existing  algorithms  for  change  detection,  however,  interference 
covariance  matrices  are  treated  as  identity  matrices  or  assumed  to  be  known. 
There  exists  a  substantial  amount  of  research  in  the  failure  detection  area  as 
discussed  by  Kerr  [4]  and  Basseville  and  Benveniste  [5],  with  several  sequential 
type  solutions.  A  linear  time-varying  state  model,  with  an  additional  additive 
impulsive  component  representing  the  failure  or  change  in  state,  is  typically  used 
in  this  area  as  described  by  Brumback  [6]  and  Kerr  [7].  This  is  a  particularly  ap¬ 
pealing  model  for  radar  or  sonar  applications  for  it  may  accurately  represent  the 
non-st ationarity  of  the  interference  and  noise  structure.  However,  since  complete 
knowledge  of  the  time- varying  model  parameters  is  assumed,  it  is  not  appropriate 
for  the  unknown,  slowly  changing  interference  and  noise  case.  Fortunately,  there 
exist  similarities  including  unknown  fault  levels  and  the  difficulties  involved  in 


choosing  detection  thresholds. 
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As  previously  mentioned,  Page’s  test  provides  a  method  for  detecting  the 
change  from  one  known  distribution  to  another  known  distribution.  Clearly 
the  unknown  complex  signal  amplitude  and  the  unknown  interference  covari¬ 
ance  structure,  as  well  as  its  non-stationarity,  will  cause  substantial  problems 
in  applying  this  method.  Analysis  to  date  in  this  area  includes  applying  Page’s 
test  to  univariate  (single  sensor)  cases  with  known  interference  and  unknown 
signal  characteristics.  For  this  univariate  case,  Dyson  [8],  Broder  [9],  and  Stahl 
[10]  have  considered  locally  optimal  test  statistics  to  avoid  the  unknown  signal 
strength  problem  and  have  assumed  ideal  normalization,  which  is  equivalent  to 
complete  knowledge  of  the  interference  structure.  Blostein  [11]  has  considered  the 
univariate,  ideal  normalization  case  with  a  non-stationary  signal  of  known  form 
and  unknown  amplitude.  A  Bayesian  formulation  of  the  problem  involves  apply¬ 
ing  a  prior  density  to  the  unknown  signal  strength  as  described  in  Basseville  and 
Nikiforov  [3].  It  seems  that  applying  Page’s  test  to  the  multivariate,  unknown 
interference  case  has  not  been  considered. 

In  resolving  the  interference  non-stationarity  problem,  a  short  block  of  data 
that  is  assumed  to  have  stationary  interference  is  often  analyzed.  Here  the  multi¬ 
variate  case  has  been  addressed  as  well  as  that  of  an  unknown  interference  covari¬ 
ance  structure.  Generalized  likelihood  methods  are  used  to  form  test  statistics 
that  are  compared  to  thresholds  for  deciding  signal  presence  in  the  Neyman- 
Pearson  fashion  (attempting  to  maximize  the  detection  probability  for  a  fixed 
false  alarm  rate).  The  block  is  then  shifted  along  the  data  sequence,  continually 
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testing  for  signal  presence.  Typically  a  trailing  block  that  is  assumed  to  contain 
no  signal  is  used  to  aid  in  the  estimation  of  the  interference  covariance  structure. 
A  statistical  analysis  of  this  concept  was  first  performed  by  Reed,  Mallet  and 
Brennan  [12].  A  detector  having  a  constant  false  alarm  rate  (CFAR)  property 
was  first  introduced  by  Kelly  [13],  followed  by  Robey  [14]  and  Chen  and  Reed 
[15].  The  detector  based  on  the  work  of  Reed,  Mallet  and  Brennan  [12]  does 
not  have  the  CFAR  property.  Therefore,  detection  thresholds  will  depend  on 
the  unknown  interference  covariance  structure.  The  latter  developments  [13]-[15] 
provide  methods  for  choosing  thresholds  which  makes  implementing  the  detector 
feasible.  The  CFAR  property  is  the  result  of  forming  test  statistics  that  have 
probability  density  functions  (PDF’s)  that  do  not  depend  on  the  interference  co- 
variance  structure  when  no  signal  is  present.  Unfortunately,  the  PDF’s  of  these 
test  statistics,  and  thus  detector  thresholds,  are  often  very  difficult  to  derive  and 
evaluate. 

In  summarizing  relevant  existing  research  it  is  found  that  sequential  tech¬ 
niques  utilizing  Page’s  test  have  been  developed  for  known,  stationary  interfer¬ 
ence  covariance  structures  in  the  univariate  case  allowing  an  unknown  signal 
amplitude  and  phase.  Those  methods  that  deal  with  the  multivariate,  unknown 
interference  covariance  are  based  around  non-sequential,  Neyman-Pearson  test¬ 
ing  using  statistics  formed  from  short  blocks  of  data  over  which  the  interference 


is  assumed  to  be  stationary. 
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This  research  proffers  a  technique  that  involves  segmenting  the  incoming  data 
sequence  into  non-overlapping  blocks  of  equal  size  small  enough  that  the  data 
within  each  block  may  be  assumed  to  be  stationary.  Prom  each  block,  a  uni¬ 
variate  statistic  is  formed  in  such  a  fashion  that  its  probability  density  function 
does  not  depend  on  the  interference  covariance  matrix  when  no  signal  is  present. 
This  property  is  known  as  ancillarity  and  insures  that  any  detector  based  on  this 
statistic  will  have  a  constant  false  alarm  rate.  The  ancillarity  of  the  block  level 
statistic  to  the  interference  covariance  structure  clearly  solves  the  unknown  inter¬ 
ference  covariance  problem  at  the  expense  of  time  resolution.  The  ancillarity  also 
deals  with  the  non-stationarity  of  the  interference  from  block  to  block.  Indirect 
dependence  on  the  interference  structure  is  allowed,  when  a  signal  is  present,  in 
the  form  of  a  scalar  quantity,  the  signal-to-interference  power  ratio  (SIR).  There 
exists  no  optimal  method  for  forming  the  block  level  statistic,  so  two  approaches 
are  considered  that  utilize  generalized  likelihood  ratio  techniques. 

Prom  a  microscopic  point  of  view,  the  problem  has  now  become  a  simple  versus 
composite  hypothesis  test,  in  that  the  distribution  of  the  block  level  statistic  is 
known  explicitly  when  no  signal  is  present  and  only  depends  on  the  scalar  SIR 
otherwise.  This  situation  now  resembles  what  Dyson  [8]  considered  and  is  treated 
by  using  a  locally  optimal  detector  non-linearity.  An  alternative  approach  is  to 
form  a  test  based  on  a  design  signal-to-interference  ratio  in  the  hopes  that  the 
test  will  work  well  for  a  wider  range  of  SIR.  These  two  methods  provide  detector 
functions  or  non-linearities  that  are  applied  to  the  block  level  statistic  prior  to 
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submission  to  Page’s  test.  Page’s  test  will,  with  some  delay,  determine  when 
the  signal  has  occurred  at  the  block  level.  Post  block  processing,  in  the  form 
of  a  maximum  likelihood  estimate,  provides  higher  resolution  estimation  of  the 
onset  time.  A  block  diagram  describing  the  proposed  algorithm  can  be  found  in 


figure  2. 


Data 


/ 


Page’s  Test 
Statistic 


Form  Block  Level  Statistics 


Block  Level  Page’s  Test 


Post  Block  Processing: 

•  MLE  for  unknown  start 
time  on  data  prior  to  block 
detection. 


Apply  Detector  Non-linearity 


Figure  2:  Block  diagram  of  proposed  algorithm. 


The  basic  problem  addressed  in  this  dissertation  is  the  application  of  Page’s 
test  to  the  multivariate  unknown,  slowly  varying  interference  covariance  matrix 
and  unknown  complex  signal  amplitude  problem.  Although  this  is  a  specific 
problem,  the  solution  is  applicable  to  a  particular  subset  of  the  general  problem 
as  stated  in  equation  (1):  those  problems  that  have  common  unknown  (nuisance) 
parameters  before  and  after  the  change  takes  place,  subject  to  the  existence  of 
block  statistics  that  are  ancillary  for  the  nuisance  parameters  under  the  null 
hypothesis  and  depend  on  at  most  an  unknown  scalar  strength  parameter  under 


the  alternative  hypothesis. 
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This  dissertation  is  organized  as  follows:  Background  information  on  Page’s 
test,  performance  measures,  and  the  choice  of  the  detector  non-linearity  are  dis¬ 
cussed  in  chapter  2.  Chapter  3  contains  two  approaches  for  forming  the  block 
level  statistic,  the  derivation  of  each  statistic,  and  a  statistical  description  in  the 
form  of  conditional  probability  density  functions.  The  locally  optimal  detector 
non-linearity  derivation  and  optimal  methods  of  choosing  a  bias  for  Page’s  test 
follow  in  chapter  4.  The  performance  of  the  detectors  is  explored  for  varying 
SIR,  number  of  sensors,  and  block  size  in  chapter  5.  The  post  block  processing 
algorithm  is  derived  and  evaluated  in  chapter  6.  A  simulation,  coupling  the  block 
detection  utilizing  Page’s  test  to  the  post  block  processing,  for  a  stationary  and 
a  non-stationary  unknown  interference  scenario  is  found  in  chapter  7,  followed 
by  concluding  remarks  and  recommendations  in  chapter  8.  Several  theorems  are 
used  throughout  this  dissertation  with  their  statements  and  proofs  found  in  ap¬ 
pendix  A.  A  method  for  approximating  the  non-central  Fisher’s  /  probability 
density  function  is  described  in  appendix  B.  Appendix  C  contains  a  summary  of 
the  probability  density  functions  and  selected  properties  of  the  random  variables 


used  in  this  dissertation. 


Chapter  2 


Page’s  Test  Background 


The  detector  structure  proposed  in  this  dissertation  utilizes  Page’s  test  [1]  to 
determine  the  time  that  a  distributional  parameter  of  a  sequence  of  random  data 
samples  changes  from  following  a  null  hypothesis  (H)  to  following  an  alternative 
hypothesis  (. K ).  Page’s  test  uses,  as  a  detector  statistic,  the  difference  between 
the  current  value  of  a  cumulative  sum  statistic  and  the  minimum  value  over  all 
past  sums.  The  cumulative  sum  statistic  is  the  sum  of  a  detector  non-linearity, 
g,  applied  to  the  data,  Xx,  at  time  sample  i, 

S(  =  £p(X6).  (2) 

k= 1 

Page’s  test  compares  the  detector  statistic, 

Ti  =  Si  -  min  Sj,  (3) 
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to  a  threshold,  h, 

K 

Ti  >  h,  (4) 

H 

to  test  for  the  alternative  hypothesis.  The  detector  statistic  may  alternatively  be 
written  in  a  recursive  form, 

Ti  =  max  {0,Tj_i  +g  (Xj)} .  (5) 

The  expected  value  of  the  detector  non-linearity  is  required  to  be  negatively 
biased  under  the  null  hypothesis  and  positively  biased  under  the  alternative, 

E  [9\0h]  <  0  <  E  [g\0K\ ,  (6) 

where  E  [g\0n\  and  E  [g\0K]  respectively  represent  the  expected  value  of  the  output 
of  the  detector  non-linearity  under  the  null  and  alternative  hypotheses. 

2.1  Performance  Measures 

The  performance  of  Page’s  test,  when  applied  to  signal  detection,  is  defined  by 
the  average  number  of  samples  between  false  detections  when  no  signal  is  present, 
T,  and  the  worst  case  average  number  of  samples  required  before  detection  when 
a  signal  is  present,  D.  The  worst  case  condition  is  taken  over  all  possible  starting 
times  for  the  signal  and  occurs  whenever  the  statistic  of  equation  (3)  or  (5) 
returns  a  value  of  zero.  Broder  [9]  has  shown  that  for  large  values  of  T,  there 
is  an  exponential  relationship  between  the  average  time  between  false  alarms 
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and  the  average  delay  before  detection.  Prom  this  relationship  an  asymptotic 
performance  measure  is  formed, 


77  =  lim 


logT 


xxxxx  __  « 

h— >oo  D 


(7) 


where  the  natural  logarithm  is  used  and  h  is  the  threshold  used  in  Page’s  test, 
which  is  monotonically  increasing  in  T.  As  shown  by  Broder  [9]  and  Willett 
[16],  approximations  to  T  and  D  may  be  found  by  using  standard  techniques  of 
sequential  analysis,  resulting  in 


1  +  ht  ($H)  -  eht 
t  (6h)  E  [g\On\ 

_Qht{6H) 

t  (Oh)  E  [g\On\ 


(8) 

(9) 


and 


E  [g\0Ky 

where  t  ( 0H )  is  the  non-zero,  unity  root  of  the  moment  generating  function  of  the 
detector  non-linearity  under  the  null  hypothesis, 


E  [eWs| 9h\  =  1.  (11) 

The  approximations  in  (9)  and  (10)  are  valid  for  large  values  of  the  detector 
threshold,  h,  and  are  also,  as  seen  in  Broder  [9],  respectively  lower  and  upper 
bounds.  Applying  these  approximations  to  the  asymptotic  performance  measure 
results  in  the  approximation 

-  _  lim  E  (g\0K\  [ht  (0H)  -  log  (-t  (0H)  E  [g\0H])\ 

h— >oo  h 
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=  t(6H)E\g\dK\ 

=  V- 


(12) 


As  mentioned  by  Broder  [9],  the  approximation,  r),  is  also  a  lower  bound  on 
the  asymptotic  performance  measure  of  equation  (7).  In  this  dissertation,  the 
approximation  to  Broder’s  asymptotic  performance  measure,  T),  will  be  termed 
the  asymptotic  efficacy. 

From  equation  (9)  the  threshold  required  to  implement  Page’s  test,  for  a 
specified  average  time  between  false  alarms,  may  be  approximated  by 


log[-t  (6H)E\g\9H]T\ 
t  (Oh) 

logT  4-  log  [~t  (Oh)  E  jgjflg]] 
t  (Oh) 


(13) 


2.2  Detector  Non-Linearities 


The  detector  non-linearity  may  be  optimally  chosen  so  that  for  a  lower 
bounded  average  time  between  false  alarms,  T  >  T0,  the  worst  case  average 
delay  to  detection  is  minimized.  Lorden  [17]  proved  this  result  in  an  asymptotic 
sense  as  T0  — >  oo  for  a  log-likelihood  ratio  detector  non-linearity, 


l  (x)  =  log 


fx  (x\dK) 


[fx(x\0H)\ 


(14) 


where  fx  (x\ 0H)  and  fx  (x\0K)  are  respectively  the  probability  density  functions 
of  the  data  under  the  null  and  alternative  hypotheses.  Moustakides  [18]  extended 


the  result  to  the  non-asymptotic  case. 
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The  locally  optimal  non-linearity  for  a  scalar  signal  strength  parameter,  9, 
was  considered  by  Dyson  [8], 


g(x) 


[&fxm\e=eH 

fx  (*| Oh) 


(15) 


and  shown  to  be  optimal  for  9  near  9jj.  Dyson  actually  used  a  first  order  Taylor 
series  expansion  of  the  log-likelihood  ratio  about  9h,  including  a  bias  term  on  the 
order  of  ( 9  —  9h)2,  which  results  in  a  detector  non-linearity  that  is  still  a  function 
of  the  signal  strength.  Under  the  locally  optimal  or  weak  signal  assumption,  the 
bias  term  may  be  considered  insignificant,  and,  if  the  signal  strength  is  assumed 
to  be  constant,  the  result  is  a  scale  of  equation  (15).  In  order  to  apply  (15)  to 
Page’s  test,  as  seen  in  equation  (6),  a  bias  must  be  applied  so  that  the  mean 
of  the  detector  non-linearity  is  negative  when  no  signal  is  present.  As  seen  in 
section  4.3,  the  bias  may  be  chosen  so  as  to  minimize  the  signal  strength  required 
to  achieve  a  desired  asymptotic  efficacy  or  to  equivalently  maximize  the  efficacy 


for  a  given  signal  strength. 


Chapter  3 


Block  Level  Statistics 


The  proposed  solution  to  the  data  stationarity  and  unknown  interference  pa¬ 
rameter  problems  involves  segmenting  the  time  series  data  into  blocks  of  equal 
length.  This  allows  each  block  of  data  to  be  compressed  into  a  single  univariate 
statistic  having  the  predominant  property  of  ancillarity  with  respect  to  the  un¬ 
known  interference  parameters.  A  statistic  formed  from  the  random  variable  X 
with  distribution  fx  (x\ 9)  where  9  is  a  parameter, 

T  =  g  (X) ,  (16) 

is  ancillary  for  6  if  its  distribution,  fT  (t),  does  not  depend  on  6  [19].  The  ancillar¬ 
ity  avoids  the  stationarity  problem  in  that,  when  no  signal  is  present,  a  change  in 
interference  structure  from  one  block  to  the  next  does  not  affect  the  distribution 
of  the  block  level  statistic. 
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In  addition  to  the  ancillarity  of  the  block  level  test  statistic  to  the  interfer¬ 
ence  parameters,  it  is  desired  that  the  statistic  provide  adequate  detection  per¬ 
formance  when  applied  to  Page’s  test  for  a  wide  range  of  signal  powers.  Ideally, 
the  formation  of  the  univariate  test  statistic  from  the  block  of  multivariate  data 
would  be  performed  by  optimizing  some  detection  criteria  related  to  Page’s  test, 
for  instance,  the  asymptotic  efficacy.  Unfortunately,  there  is  no  known  method 
for  optimally  dealing  with  the  unknown  interference  parameters.  The  strategy 
adopted  herein  is  to  break  the  problem  into  two  operations:  first,  compress  the 
block  level  data  into  a  univariate  statistic  that  is  ancillary  for  the  unknown  inter¬ 
ference  parameters,  allowing  dependence  only  through  a  signal  related  parameter, 
the  signal-to-interference  ratio  (SIR);  next,  apply  a  non-linearity  to  the  univari¬ 
ate  statistic  that,  in  some  sense,  optimizes  the  detection  performance  of  Page’s 
test. 

In  choosing  a  method  of  compressing  the  block  data  into  a  univariate  statistic, 
the  primary  requirement  is  ancillarity  with  respect  to  the  unknown  interference 
parameters.  Statistics  that  are  invariant  under  a  linear  transformation  of  the 
array  data  under  the  null  hypothesis  provide  ancillarity  for  the  interference  pa¬ 
rameters  as  well  as  the  potential  for  a  maximally  invariant  statistic.  Scharf  [20] 
indicates  that  only  considering  decision  rules  that  are  invariant  to  such  transfor¬ 
mations  is  appropriate.  The  function  that  forms  the  univariate  statistic  from  the 
block  data  is  invariant  under  a  linear  transformation  if  the  function  has  the  same 
value  for  all  linear  transformations  of  original  data  [21]  and  [22].  The  existence 
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of  a  maximally  invariant  statistic  is  not  guaranteed  and  is,  in  general,  difficult  to 
test  for.  As  an  alternative  to  data  compression  under  the  premise  of  invariance, 
consider  hypothesis  testing  in  the  presence  of  unknown  parameters;  the  subop- 
timal,  although  usually  very  good,  generalized  likelihood  ratio  test  (GLRT)  is 
often  used.  This  method  of  compressing  block  data  into  a  univariate  statistic 
may  yield  a  statistic  that  is  ancillary  for  the  unknown  interference  parameters 
and  is  potentially  good  for  detection  in  the  Neyman-Pearson  sense1  . 

When  estimators  of  the  unknown  interference  parameters  exist  in  the  same 
form  when  signal  is  present  and  absent,  a  block  level  statistic  may  be  formed 
by  substituting  the  estimates  into  a  likelihood  ratio  test  (LRT)  or  a  GLRT  for 
the  unknown  signal  parameters  where  the  interference  parameters  are  assumed 
to  be  known.  This  method  has  been  applied  to  adaptive  array  signal  detection 
by  Robey  [14]  and  Chen  and  Reed  [15],  with  the  result  termed  an  adaptive 
matched  filter  (AMF).  In  the  following  sections,  the  GLRT  and  the  method 
involving  the  substitution  of  interference  parameter  estimates  will  be  investigated 
for  a  deterministic  signal  with  unknown  complex  amplitude.  In  particular,  the 
function  defining  the  compression  of  the  block  data  will  be  derived  and  the  block 
level  statistic  will  be  described  statistically.  The  following  sections  apply  these 
two  methods  of  data  compression  to  the  unknown  complex  signal  amplitude, 

unknown  interference  covariance  matrix  problem. 

1The  GLRT  maximizes  an  upper  bound  on  the  best  case  detection  probability  while  also 
bounding  the  worst  case  false  alarm  probability. 
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3.1  GLR  Statistic 

3.1.1  Derivation 

The  deterministic  signal  model  assumes  that  the  frequency  domain  array  data 
is  statistically  represented  by  an  iV-variate  complex  Normal  distribution, 

Xj  ~  CJ\fN  (9d,  E) ,  (17) 

for  i  =  1, . . . ,  M,  where  M  is  the  block  size,  9  is  the  complex  signal  strength,  d 
is  a  known  mean  vector,  and  E  is  the  interference  covariance  matrix.  The  nota¬ 
tion  x  ~  CJ\f  n  (i u,  E)  indicates  that  the  JV-by-1  random  vector  x  has  a  complex 
Normal  distribution  with  mean  /i  and  covariance  matrix  E.  Throughout  this  dis¬ 
sertation,  bold  lower  case  letters  indicate  vectors,  bold  uppercase  letters  indicate 
matrices,  and  the  superscripts  T  and  H  respectively  represent  the  transpose  and 
complex  conjugate  and  transpose  operations. 

The  generalized  likelihood  ratio  (GLR)  statistic  for  the  deterministic  signal 
model  has  the  following  form, 

max  /  (X|0,  E) 

A  _  eee, Sen  v 

GLR~  max  /  (X|E) 

Sen  v 

where 

©  =  C1  (19) 

and 

n  -  {E  g  CNxN  :  E  >  0,  E  =  Eh}  (20) 
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where  C  is  the  complex  plane  and  X  >  0  indicates  that  the  matrix  S  is  positive 
definite.  The  likelihood  function  of  the  data  in  the  block  is 

M  1 

/(X |8,S)  =  s 

r  i  /  i  m  \iM 

=  ^jetr(-s_1MBx-0d)(^-^d)HJ |  .  (21) 

as  found  in  Goodman  [23].  The  function  etr  represents  an  exponential  trace 
operator, 

etr(A)=etrace<A>,  (22) 

and  the  jV-by-M  matrix  X  is  the  combined  data  of  the  block, 

X  =  [xi  x2  •  •  •  xM] .  (23) 


The  GLR  statistic  for  this  problem  may  be  found  by  a  development  similar 
to  that  of  Kelly  [13].  Kelly  considered  testing  one  data  vector  for  the  presence  of 
a  signal  when  independent  secondary  data  with  the  same  interference  covariance 
structure  was  available  to  assist  in  estimation.  The  problem  at  hand  results  in  a 
similar  sufficient  statistic  probability  density  function.  Proceeding,  set 


1  M 

B  (9)  =  i  £(*  -  fldX*  -  ed)H,  (24) 

i=l 

and  substitute  the  deterministic  signal  model  probability  density  function,  (21), 
into  the  numerator  of  (18), 


max  /  (X|0,  X) 
0ee,£en  v 


max 


(25) 
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The  maximization  with  respect  to  E  is  identical  to  that  for  determining  the 


maximum  likelihood  estimate  of  an  unknown  covariance  matrix  for  a  Gaussian 
random  vector  with  a  zero  mean.  The  specific  result  may  be  shown  by  equating 
the  derivative,  with  respect  to  E,  of  the  quantity  inside  the  brackets  in  equation 
(25)  to  zero, 

d  etr  (— E_1B  (6))  etr  [-E^B  (0)] 


as 


7T 


N  IE! 


irN  |E| 


E_1B  (0)E-1  -  E 


etr[  E  lB  Wls-i  [B  ^  (26) 


7 r 


where  the  following  matrix  derivatives,  as  found  in  Scharf  [20],  are  utilized 


8 

dE 


|S|  =  |E|  (E-1) 


H 


(27) 


and 


^tr  (E-‘A)  -  (-E-'AE-1)" .  (28) 

Clearly,  the  derivative  of  equation  (26)  equals  zero  when  E  is  chosen  to  be 


E  =  B(0). 


(29) 


This  in  turn  causes  the  numerator  of  the  GLR  statistic  to  become 

r  i  lM 

_ f  (XI 6.  E^  =  max  I  —  —  ,  ^ ,  e~N 

ee& 


6e@ 


nN  |B  (0)| 


(30) 


The  maximization  over  6  may  be  accomplished  by  minimizing  the  determinant 
of  B  (6).  Proceeding,  B  (0)  is  written  in  terms  of  the  sample  mean  and  sample 


covariance  matrix, 


M 


b(0)  =  Y,  Kx*  -  x)  +  (x  - 0d)]  t(x*  -  x)  +  (x  -  0d)]H 

2— 1 


S  +  (x  —  #d)(x  —  9d)H, 


(31) 
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where 


1  M 


X 


(32) 


and 

s  =  i£(*i -*)(*-*)"  <33) 

i=  1 

As  seen  in  Muirhead  [24],  the  sample  covariance  matrix  of  equation  (33),  which  is 
clearly  hermitian,  is  positive  definite  with  probability  one  if  and  only  if  M  >  N. 
Thus,  a  square  root  factorization  may  be  applied  where  the  square  root  matrix 
is  also  positive  definite  and  invertible, 


S  =  TTh  (34) 

r  >  o. 


The  determinant  of  B  (9)  may  now  be  factored  into 


|b(0)|  =  |r[iJV  +  r-1(x-M)(x-0d)H(rH)-1 
=  \rrH\  [i  +  (x  -  ^d)H(rH)-1r-1(x  -  0d) 
=  |S|  [l  +  (x  —  9d)HS~1(x  —  0d)] , 


(35) 


where  lN  is  the  N  dimensional  identity  matrix.  By  completing  the  complex 
square  in  9, 


]B  (0)|  =  |S|  [l  +  x^S-'x  -  e*dHS~lx  -  e^S-'d  +  ^*dHS-1d 

dHS_1: 


1  +  xHS_1x  — 


dIiS~1x 


dHS~1d 


+  dHS~1d 


9 


dHS~'d 


(36) 
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where  the  superscript  *  is  the  conjugation  operator.  Since  S  is  positive  definite, 
dHS-1d  is  greater  than  zero.  Thus,  |B  (<9)|  is  minimized  by  choosing 


e  = 


d^S-1x 

dHS-1d' 


(37) 


Under  the  maximization  with  respect  to  9,  the  numerator  of  the  GLR  statistic 


becomes 


/(X \e,t)  = 


(ire)~UN  |Sf" 


T  M  ‘ 


1  1  xH’S~1x  ldHS'^l2 

1  ^ x  °  x  dHS_1d 

The  denominator  of  the  GLR  statistic  is  exactly  equation  (30)  with  9  —  0, 

1 


(38) 


max  /  (X|S)  = 
Sen  v  ; 


„-jv 


M 


_ttn  |B(0)| 

Setting  9  =  0  in  equation  (35)  and  substituting  |B(0)|  into  (39)  yields 

(ve)-MN  |S|-M 


(39) 


/(X|S)  = 


[1  +  xffS-1x]M 


(40) 


By  combining  (38)  and  (40),  the  resulting  GLR  statistic  is 

-i  M 

A  l+x^S^x 

A GLR  __  i  jh  c-i-v-i2 


1  +  xHS-1x-  |dHS~ljt 
i  -r  x  o  x  d-HS_1d 


(41) 


3.1.2  Conditional  Distribution 


The  Mth  root  of  the  GLR  statistic  may  equivalently  be  used  in  evaluating  the 
conditional  distribution,  as  it  is  a  one-to-one  transformation  from  A glr, 

T  =  (Aglr)m  ■  (42) 

The  first  step  in  statistically  describing  T  for  analysis  is  to  form  whitened  data. 
This  is  accomplished  by  a  full  rank  linear  transformation  of  the  block  level  data 
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so  that  the  covariance  matrix  of  the  resulting  transformed  variables  is  the  N 
dimensional  identity  matrix.  The  linear  transformation  is  described  by 

y  i  =  A Xi,  (43) 

for  i  =  1, . . . ,  M.  Because  the  transformation  is  full  rank,  it  may  additionally  be 
defined  to  rotate  the  mean  vector  to  a  scaled  unit  vector  with  weight  only  in  the 
first  element, 

r  iT 

ei  =  l  o  •  •  •  o  t44) 

This  is  accomplished  by  choosing  the  transformation  matrix  as 

A  =  PT"1,  (45) 

where  T  is  a  square  root  of  the  data  covariance  matrix,  S, 

s  =  rr^,  (46) 

and  P  is  an  orthogonal  matrix  chosen  so  that 

A0d  =  0Pr-xd  =  /tei,  (47) 


where  (3  satisfies 

\p\2  =  \0\2dH?;-1d  =  s,  (48) 

where  s  is  defined  as  the  adaptive  signal-to-interference  ratio.  This  definition  of 
the  SIR  is  from  the  beam  output  of  a  minimum  variance  distortionless  response 


24 


adaptive  beamformer  when  the  interference  structure  is  known.  The  whitened 
block  data  vectors  are  now  distributed  as 


Yi  ~  CM N  (/?e;i,Ijv)  ■ 


(49) 


The  sample  mean  of  the  whitened  data, 


M 


y 


i= 1 


=  Ax, 


(50) 


is  distributed  as 


y  ~  CMn  ( /5ei,  —  In  )  • 


(51) 


The  sample  covariance  matrix  of  the  whitened  data  has  the  form 


u 


c  =  igE(y i-y)(y<-y) 

m  i= 1 


H 


=  A 

=  asah. 


1 


M 


•jj  £(*-*)  (*-*)* 

i=l 


(52) 


(53) 


As  seen  in  Muirhead  [24]  for  real  Normal  random  vectors,  the  sample  mean 
vector  and  the  sample  covariance  matrix  of  complex  Normal  random  vectors  are 
independent  and  the  sample  covariance  matrix  is  the  scale  of  a  complex  Wishart 
distributed  matrix, 

MC  ~  CWN  (M  -  1,  IN) .  (54) 

The  notation 


X  ~  CWN  (L,  S) 


(55) 
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indicates  that  the  iV-dimensional  random  matrix  X  has  a  complex  Wishart  dis¬ 
tribution  with  L  degrees  of  freedom  and  scale  matrix  X. 

Applying  this  whitening  transformation  to  equation  (41),  the  Mth  root  of  the 
GLR  statistic  becomes 

T  = _ 1  ±  fHC~l - T.  (56) 

1  |eH C-1V  2  V  ' 

i  +  yHc-^y-^4r 

Choosing  the  transformation  to  rotate  all  the  weight  of  the  mean  vector  into  the 
first  element  of  the  transformed  data  vectors  separates  the  data  into  a  signal  sub¬ 
set  and  a  non-signal  subset.  The  signal  subset  is  comprised  of  the  first  element  of 
each  transformed  data  vector,  while  the  non-signal  subset  contains  the  remaining 


data, 


where  the  subscript  a  represents  the  signal  data  and  the  subscript  {,  represents 
the  non-signal  data. 

The  transformed  data  may  be  grouped,  column-wise,  into  the  N-bj-M  matrix 


Yi  Y2  •  •  •  Ym  ■ 


Applying  the  signal-non-signal  segmentation  to  this  matrix  yields 


Y  = 


Val  Ua 2  •  '  '  VaM 


y&i  y&2  •  •  •  YbM 


(59) 
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where  signal  data  is  grouped  into  an  M- by-1  vector, 

-tT 

~  Pal  Pa2  •  •  •  PaM 
and  non-signal  data  is  grouped  into  an  N  —  1-by -M  matrix, 


(60) 


Yi,  = 


ybi  y&2  •  •  •  y  bM 


(61) 


Applying  this  segmentation  to  the  transformed  data  sample  mean  vector  and 
sample  covariance  matrix  yields  respectively, 


and 


C  = 


Pa 

y  b 


r  rH 

c aa  ba 

^ba  Cfrft 


(62) 


(63) 


As  seen  in  (56),  the  inverse  of  the  sample  covariance  matrix  is  required,  and  may 
be  segmented  as  follows: 


C"1  =  D  = 


daa  djQ 
dfca 


(64) 


The  relationships  between  the  partitioned  sample  covariance  matrix  and  its  in¬ 
verse  are  easily  derived  and  may  be  found,  for  example,  in  Muirhead  [24], 


da}  =  caa  -  clCJcba 


*1  Cfa 


(65) 

(66) 


=  D»  -  (ijdtedg. 


(67) 
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Applying  the  signal-non-signal  segmentation  to  the  quadratic  and  bilinear  forms 
of  equation  (56)  results  in, 


efC~1e1  =  daa  (68) 

efc_1y  =  daaVa  +  d^fb  (69) 

yHC_1y  =  daa  \Va\ 2  +  Vafb  d6a  +  V*a^baYb  +  Yb^bbfb-  (70) 


Completing  the  complex  square  for  ya  in  (70)  yields  the  form 


y^C  1y 


(71) 


Substituting  these  forms  into  (56)  results  in 


T  = 


1  +  y bCbbYb  +  daa  ya  +  djdgyh 


1  +  y  bCbbfb 


(72) 


Subtracting  one  from  each  side  and  scaling  the  resulting  numerator  and  denomi¬ 
nator  by  2 M  yields  the  form 


W  =  T-  1  = 


2 M  (l  +  yf  C bbyb)  1 1 ya  +  dj  d£ayb 


2  Mdg 


(73) 


Conditioned  on  observed  values  of  the  non-signal  data,  the  numerator  and 
denominator  of  W  are  independent  Chi-squared  random  variables.  The  numer¬ 
ator  is  non-central  with  two  degrees  of  freedom  and  the  denominator  is  central 
with  2 (M  -  N)  degrees  of  freedom.  This  will  be  shown  by  forming  the  numerator 
and  denominator  of  W  as  quadratic  forms  with  the  vector  y*.  Under  certain 
conditions,  the  quadratic  form  involving  either  real  or  complex  Gaussian  ran¬ 
dom  vectors  has  a  non-central  Chi-squared  distribution.  The  results  for  the  real 
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case  are  commonly  known  and  may  be  found  in  Searle  [25].  The  results  for  the 
complex  case,  found  in  Theorem  A.l  of  appendix  A,  are  similar. 

In  order  to  express  the  numerator  and  denominator  of  W  as  quadratic  forms, 
the  sample  covariance  matrix,  C,  must  be  described  in  terms  of  the  signal  and 
non-signal  data.  Expanding  the  sum  of  equation  (52)  results  in  the  form 

1  M 

c  = 

i—1 

=  ^YY"  -  yyH.  (74) 


Applying  the  signal-non-signal  segmentation  yields 


C  = 


1 

yjy  a 

yjvf 

VaVa 

yayb 

M 

yby ; 

YbYf 

y*a?b 

ybfb 

(75) 


Noting  that  the  segmented  sample  mean  vector  of  the  transformed  data  may  be 
expressed  as  linear  combinations  of  the  total  data, 


ya 

iriTya  =  &  y^i 

(76) 

yb  = 

■kYbi  > 

(77) 

where  1  is  a  column  vector  of  ones,  the  elements  of  the  sample  covariance  matrix 
may  be  expressed  as 


— yTv* - yTllTy* 


1  T  fj _ 

MYq  \  M  M 


11T )  y: 


=  Jjyl?  mY*a, 


(78) 
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cta  =  jgY>y:  -  iplTK 
=  s(Y‘-y*lT)y« 

-  sY‘(I"-^nr)y: 

=  -^YtP„y;, 

c»  =  ^Y>Y?  -  WY^Y“ 


~  SY*('«-5lirlY* 

_  J_v  p  V-^ 

-  M  fc  m  6’ 


H 


(79) 


(80) 


where  Pm  is  a  dimension  M  projection  matrix  with  form 

P™  =  I illT. 


M' 


(81) 


Defining 


Kb  =  (l  +y?C661y&) 


M1 


I*  +  — Yf  C^Y6 


1, 


(82) 


and  applying  equations  (65)-(67)  and  (76)-(80),  the  numerator  of  IP  becomes, 


=  2  M/Cft1 

Va  +  4adtoy& 

=  2  Mk^1 

-  c^C^yt 

=  2  Mk^1 

1  Ji 

My“ 

1  Pm 

=  2 
=  2 


y/MKb 

rp  |2 

yap& 


T 

ya 


b  ^bb  > 

I m  -  ip„YfCs‘Y6 


=  2yJ  (pi,pf)  y; 


(83) 
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where 

p‘  =  7fe[lM-5PmYfc“Y‘]1'  (84) 

If  WN  is  to  be  a  Chi-squared  random  variable,  as  seen  in  Theorem  A.l,  the 
product  of  the  matrix  in  the  quadratic  form  (i.e.,  p&pf)  and  the  covariance 
matrix  of  the  Gaussian  random  vector  must  be  idempotent.  The  vector  ya  is 
formed  from  the  first  elements  of  the  transformed  data  vectors,  y*,  which  are 
distributed  as  described  in  (49).  Thus,  the  vector  y*  is  distributed  as 

y*  ~  CM m  (/3*1, 1 m)  ■  (85) 

Since  the  covariance  matrix  of  this  Gaussian  random  vector  is  the  identity  matrix, 
if  the  matrices  in  the  quadratic  forms  of  the  numerator  and  denominator  of  W 
are  idempotent,  the  quadratic  forms  are  Chi-squared  distributed.  Being  the  outer 
product  of  a  vector  with  its  conjugate  transpose,  the  numerator  quadratic  form 
matrix  has  rank  equal  to  one  and  is  idempotent  if  the  magnitude  of  the  vector  is 
one.  Proceeding,  the  inner  product  of  the  vector,  p&,  with  its  conjugate  transpose 
is  shown  to  be  equal  to  one, 

P?P»  =  j^lT  [l«  -  iYfc^YtPm]  [l«  -  ^pmYfCi‘Y6]  1 

=  3^lT[I"  +  S5Y‘ffc“Y‘p'"Y"c“Y'']1 
-  3SSiT[i“  +  Sy'c=1y*]1 


i. 


(86) 
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This  simplification  requires  noting  that  the  vector,  1,  is  in  the  null  space  of  the 
projection  matrix  Pm, 

Pml  =  (lTPm)T  =  0  ,  (87) 

and  that,  since  Pm  is  a  projection  matrix, 

P  2m  =  Pm-  (88) 

As  seen  in  Theorem  A.l  the  non-centrality  parameter  of  WN  is  a  function  of  the 
mean  vector  of  y*  and  the  matrix  in  the  quadratic  form  of  equation  (83), 

6n  =  2  \(3\2  lTpbPb  1 
=  2|/31Tpfc 

=  ik 

=  2^,M!-9|2 

=  2  Mps,  (89) 

where  p  =  k^1.  Thus,  conditioned  on  the  non-signal  data,  Wjv  is  distributed  as 

WN  ~  Xl  (6n)  ■  (90) 


The  denominator  of  W  is  now  placed  into  a  quadratic  form  with  the  vector 


y 


* 

a’ 


=  2  Md~l 


2 M  (caa  Cf,QCj,j  Cf,aj 
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=  2yJ  [pm  -  ip„Yf C^YjP™]  y : 

=  2yJ  (pm  [lM  -  ^Yf  C5%]  Pm)  y; 

=  2yfPty;,  (91) 

where 

Pi  =  Pm  [lM  -  j^Yf  C^'Yj]  Pm.  (92) 

The  matrix  in  the  denominator  quadratic  form  is  shown  to  be  idempotent  by 
equating  the  matrix  to  its  square, 

P62  =  P„  [l«  -  Ifif  C£*Y6]  P.  [i*  -  jfl? CS‘Yt]  P„ 

=  Pm-  ^PmYfC^YbPm  +  ipmYf  eg1  (iYiP„Yf)  C^YiPm 
=  P»  [l«  -  CtfYj]  Pm 

=  Pi-  (93) 

The  non-centrality  parameter  of  Wd  is 
SD  =  2|/3|2lTPil 

=  2  \0f  lrPm  [l„  -  -^Yf  C^Yi]  Pml 
=  0,  (94) 

thus  resulting  in  a  central  Chi-squared  distribution.  The  degrees  of  freedom 
parameter, 

rD  =  2tr  (P&IM) 
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=  2tr(Pm)--|tr(PmYfC6-t1YtPm) 

=  2tr  (l„  -  jllr)  -  -|tr  (Y6PmYf Ci1) 

=  2  (M  -  1)  -  2tr  (Iat_i) 

=  2(M  -  N).  (95) 

Thus,  conditioned  on  the  non-signal  data,  Wp  is  distributed  as 

WD  ~  Y|(m_w)  (0) .  (96) 

The  ratio  of  a  non-central  Chi-squared  random  variable  scaled  by  its  degrees  of 
freedom  to  a  central  Chi-squared  random  variable  scaled  by  its  degrees  of  freedom 
is  distributed  as  a  singly  non-central  Fisher’s  /  distribution  when  the  two  random 
variables  are  independent.  As  seen  in  Searle  [25]  and  Theorem  A. 2  of  Appendix  A, 
quadratic  forms  of,  respectively,  a  real  or  complex  Gaussian  vector,  are  pairwise 
independent  if  the  E  product  of  the  matrices  is  zero,  where  bSig  is  the  covariance 
matrix  of  the  Gaussian  vector  (which  is  the  identity  AEB  =  0,  where  A  and  B 
are  the  matrices  involved  in  the  quadratic  forms). 

Consider  the  E  product  of  the  matrices  in  the  quadratic  forms  of  W, 

PfcljvrPdPf  ex  P m  1M  -  j^^bCbbYb  Pm  Im  - -^PmYf  lpf 

=  Pm  Im  —  Pml  —  — PmYb  C6fc  Yf,l  pb 

a  [pm  -  PmYfC,;i1Ytlpf 

=  PmYfC^Ytl  -  -jtpmYfCr11YiPmYfCS1Yil]  pf 
=  PraYf  C^'Yfel  -  P^YfC^Ytl]  pf 
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=  0.  (97) 

Thus,  conditioned  on  the  non-signal  data,  Wn  and  Wd  are  independent  and  W  is 
the  scale  of  a  non-central  Fisher’s  /  distribution.  The  distributional  parameters 
of  the  numerator  and  denominator  of  W  only  depend  on  the  non-signal  data 
through  the  non-centrality  parameter  of  the  numerator,  equation  (89),  in  the 
form  of  the  variable  p.  The  probability  density  function  of  p  is  determined  as 
follows: 


where 


and 


1  +  y f  C&&V& 

2yfy  b 

_ yf  (mCh)'1^ _ 

u 

u  +  v’ 


2yfyt 

yf 


(98) 


(99) 


V  =  2yf  (Afl)yt.  (100) 

Utilizing  Theorem  A. 5  of  appendix  A,  U  is  seen  to  have  a  central  Chi-squared 
distribution  with  2 (M  —  N  +  1)  degrees  of  freedom, 

U  ~  ^2(M-N+l)i  (101) 

and  is  independent  of  y&,  because  the  sample  mean  vector,  and  the  sample 
covariance  matrix,  C^,  are  independent  and  M Cm  ~  CWn-i  (M  —  1,Ijv-i)- 
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From  Theorem  A.  1  of  appendix  A,  V  has  a  central  Chi-squared  distribution  with 
2 (N  —  1)  degrees  of  freedom, 

V  ~  •*!(„_,),  ,  (102) 

because  yb  ~  CA'iv-i  (o,  and  (Afl«-i)  is  idempotent  with 

trace  equal  to  N  -  1.  The  central  Chi-squared  distribution  results  from  the 
non-centrality  parameter,  the  quadratic  form  (MI)  0,  being  zero.  Since  U 
and  y6  are  independent,  U  and  V  are  independent.  Thus,  using  Theorem  A.6  of 
Appendix  A,  (98)  is  beta  distributed, 

p  ~  (3eta  (M  —  N  +  1,  N  —  1) ,  (103) 


with  probability  density  function, 


f(p) 


_ *  _ 

r(N-l)T{M-N  +  iy 


A 


(104) 


for  0  <  p  <  1.  Scaling  the  numerator  and  denominator  of  W  by  the  degrees  of 
freedom  of  their  Chi-squared  distributions  yields  a  Fisher’s  /  distribution.  Thus, 
conditioned  on  p, 


Zglr 


wN 

2 


Wn 


(M  —  N)W  ~  fip(M-N)  (2 Mps) , 


(105) 


where  the  notation  X  ~  fa,b  (c)  implies  that  the  random  variable  X  has  a  singly 
non-central  Fisher’s  /  distribution  with  degrees  of  freedom  parameters  a  and  b 
and  with  non-centrality  parameter  c. 
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3.2  AMF  Statistic 

As  described  by  Robey  [14],  the  adaptive  match  filter  (AMF)  statistic  is  cre¬ 
ated  by  forming  a  GLR  statistic  for  the  unknown  complex  signal  power  under 
the  assumption  that  the  interference  covariance  matrix  is  known.  An  estimate  of 
the  interference  covariance  matrix  that  is  statistically  independent  of  the  GLR 
statistic  is  then  substituted  into  this  form.  Robey  considered  the  same  primary¬ 
secondary  data  scenario  as  Kelly  [13],  as  mentioned  in  section  3.1.1,  where  the 
secondary  data  is  assumed  to  contain  interference  only  and  to  be  independent 
of  the  primary  data.  In  the  sequential  block  problem,  it  is  seen  that  the  GLR 
statistic  derived  in  the  above  fashion  is  a  function  of  the  sample  mean  vector. 
Thus,  the  sample  covariance  matrix,  which  is  independent  of  the  sample  mean 
vector  in  the  Gaussian  case,  may  be  used  as  the  interference  covariance  matrix 
estimate.  It  should  be  noted  that,  in  the  case  of  the  deterministic  signal  model, 
the  sample  covariance  matrix  is  not  the  maximum  likelihood  estimator  of  the 
interference  covariance  matrix. 

3.2.1  Derivation 

The  GLR  statistic  for  the  unknown  signal  power  as  a  function  of  the  assumed 
interference  covariance  matrix  has  the  form 

max  /  (X|0,  S) 

a  amf  (£)  =  ee&j  pqs)  ■  (106) 
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Prom  equation  (21),  the  likelihood  function  of  the  data  in  the  block  may  be 
described  as 

/ 


/(X  |0,E)  = 


1 


7T^|S 

/ 


•etr 


-E 


-l 


exp 


-d^S_1d 


V 


S  +  xx^-fldx* 


-9*xdH  +  99*ddH 


*  ax.H'E~1d 


-i  \  -i  Af 


n 


(107) 


69*  -9 


d«E-id 


X 


1 


exp 


♦  d^E-1* 

c 

d^E"1* 


md^E-^x  , 
y  d^E_1d  + 

2 


dHS-1x  2 
d«E-!d 


V 


TT^IS, 

exp  I  -d^S-xd 


d^E^d 

d^E"1*'2 


-x«S-1x-tr(s-*S)J 


“i  M 


6- 


d^E^d 


x- 


1 

f  dHE_1x 

2  \1 

|S| exp 

dHE_1d 

-  -  x^E-'x  -  tr  (E-XS) 

n  M 


(.108) 


Maximizing  this  with  respect  to  9  occurs,  as  seen  in  (108),  when 

§(s)=dH s'lx 


(109) 


d^S-'d’ 

because  the  quadratic  form,  dHE_1d,  is  always  positive  since  E  is  a  positive 
definite  matrix.  The  likelihood  function  of  the  data  in  the  block  when  no  signal 
is  present  is  exactly  equation  (107)  when  9  —  0, 


/(X  |E)  = 


7T 


1  ~\M 

wTsT  exp  (_tr  (s_1s)  -  x^s-1*) 


(110) 


Substituting  (108)  -  (110)  into  the  generalized  likelihood  ratio  of  (106)  results  in 

,2\  m 


h-AMF  (S)  =  exp 


dFE-xx 


dFE_1d 


(111) 


/ 
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The  sample  covariance  matrix,  S,  is  now  substituted  into  a  one-to-one  transfor¬ 
mation  of  (111),  resulting  in  the  AMF  statistic 


Zamf  —  log  [-A-am-f  (S)m]  — 


d*S 


-l 


x 


dHS-1d 


(112) 


3.2.2  Conditional  Distribution 


In  section  3.1.2,  the  distribution  of  the  GLR  statistic  was  determined  con¬ 
ditioned  on  a  set  of  non-signal  data.  The  effects  of  the  conditioning  were  rep¬ 
resented  as  a  Beta  distributed  random  variable.  The  distribution  of  the  AMF 
statistic  may  similarly  be  determined  by  conditioning  on  the  sample  covariance 
matrix.  Using  the  whitening  transformation  described  by  equations  (43)  to  (54), 
the  AMF  statistic  may  be  written  as 

e^C-iyf 
e^C-iei 

eigC~2e1  ^  /MC-1e1e1gC-1  \ 

2MeiHC"1ei  Y  V  e^C^ex  ) 

— 2yHPy,  (113) 

pr 


Zamf  — 


where 


and 


(e^C-'ex) 
p  ~  e^C-ie, 
2  M 

r  ~  e1ifC-1e1’ 


=  M 


C^exe^C-1 

ex^C-2ex 


(114) 

(115) 


P 


(116) 


Writing  r  in  the  form 


39 


r  =  2 


ei^Ijvei 


eiH  [M Cylex 


(117) 


and  noting,  as  seen  in  (54),  that  MC  has  a  complex  Wishart  distribution  with 
M  —  1  degrees  of  freedom  and  the  N  dimensional  identity  matrix  as  a  scale 
parameter  and  using  Theorem  A.5  of  appendix  A,  r  is  seen  to  have  a  central 
Chi-squared  distribution  with  2  (M  —  N)  degrees  of  freedom, 


x  r  ~  ^2(M-N)-  (H8) 

Based  on  the  results  of  Reed,  Mallet  and  Brennan  [12],  the  parameter  p  (equation 
29  of  [12])  has  a  Beta  distribution, 


p~(3eta  (M  —  N  +  1,N  —  1) . 


(119) 


Both  p  and  r  are  formed  from  the  sample  covariance  matrix,  leading  one  to 
believe  that  they  may  be  correlated.  Utilizing  the  data  segmentation  of  equation 
(57)  and  the  matrix  partitioning  of  equations  (63)  and  (64),  the  quadratic  forms 
describing  p  and  r  may  be  expressed  as 

eiHC-1ei  =  daa,  (120) 

and 

e1HC_2ei  =  dL  +  d^dfca 

=  d2aa  (l  +  cgC^cba)  . 


(121) 
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Substituting  these  forms  into  p  and  r  results  in 

r  =  2Md;Z,  (122) 

and 

p=(l+cgC^cia)~1.  (123) 

Using  equation  (65)  and  the  first  result  of  Theorem  A.3  of  appendix  A,  which 
states  that  d~ 0X  =  Caa.b  is  independent  of  cba  and  Cbb,  it  is  seen  that  p  and  r  are 
indeed  independent. 

Noting  that  y  has  a  covariance  matrix  equal  to  A? I n,  it  is  clear  that  its 
product  with  the  matrix  of  the  quadratic  form  in  (113)  is  the  outer  product  of  a 
unit  length  vector  with  its  conjugate  transpose, 

J_  C-Sei^C-1 

M  N  ei»C-2ei 

C-'e,  \  /  C-'e,  \ 

C-'^llJ  ll|C->ei||J 

and  is  thus  idempotent.  The  notation 

||x||2  =  Vx^x,  (125) 

is  used  to  describe  the  two-norm  of  a  vector.  Applying  Theorem  A.l  of  ap¬ 
pendix  A,  the  scaled  quadratic  form  in  equation  (113)  has  a  non-central  Chi- 
squared  distribution  with  two  degrees  of  freedom, 


2yaPy~XpS), 


(126) 


41 


and  non-centrality  parameter 


6 


2  M 


(e^CT'e,)2 

e^C^ei 


2  Mps. 


(127) 


The  AMF  statistic  may  now  be  described  as  the  scale  of  a  non-central  Chi-squared 
random  variable  conditioned  on  independent  Beta  and  central  Chi-squared  dis¬ 
tributed  random  variables, 

Zamf  =  — V,  (128) 

pr 

where 

V  ~  X%  (2 Mps) ,  (129) 

and  r  and  p,  respectively,  have  distributions  as  described  in  equations  (118)  and 
(119).  The  conditioning  on  the  central  Chi-squared  random  variable,  r,  may  be 
removed  in  the  following  manner: 


fz(z\s,p)  =  Er[fz{z\s,P,r )] 


=  E. 


E r  [prfv  (prz\s,  p,  r)] 

e"i  (if  {prz)k  e~^2 


pr  k\  2fe+1T  (k  +  1) 


f*  e-l  (I)"  (pzf 

Pto  k]  2k+M{k+l)  r 


rk+le-prz/2 


(130) 


where  fv  (v| s,p,r)  is,  as  described  in  (129),  a  non-central  Chi-squared  distribu¬ 
tion  with  two  degrees  of  freedom  and  a  non-centrality  parameter  6  =  2 Mps.  The 
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resulting  expectation  over  r  may  be  simplified  by  utilizing  the  PDF  of  a  Gamma 
random  variable,  requiring  the  general  form 


Er 


jJ  e~tT 


-L 

-L 


r^e  trfr  (r)  dr 

r= 0 

oo  rM-N-\+j  ~(t+\)r 


-dr 


=o  r  (M  -  N)  2m~n 

r  {M-N+j)(t+i)-(M-N+j) 
r(M-N)2M~N 

w  roo  rM-N+j- ie-(t+i)r 

x  Jr=0  r (M-N+j)(t+±rM-N+* 
T(M  —  N  +  j)  V 
r  (M-N)  (l  +  2t)M~N+j' 


dr 


(131) 


where  it  is  recognized  that  the  integral  in  the  third  line  of  (131)  is  the  integral 
of  the  probability  density  function  of  a  standard  Gamma  random  variable  and, 


thus,  equals  one.  Substituting  this  result  into  equation  (130)  yields, 

u(ztso)  =  ^(ffrtM-AT  +  fr  +  i) 

Tz[  ,p)  p^0  k\  r{M~N)r(k  +  i)(i  +  Pz)M-N+k+1 

U2  \  g  e-f  (f)*  r  +  k )  nf  nt+k  (%pz) 


(132) 


«  r(f  +  /c)r(f)[n2  +  nife^ 

where  n\  =  2  and  n2  =  2  (M  —  JV).  The  last  line  of  (132)  may  be  recognized  as 
the  scale  of  a  non-central  Fisher’s  /  distribution  with  degrees  of  freedom  rii  and 


n2  and  non-centrality  parameter  6.  Thus,  conditioned  on  the  random  variable  p, 
the  AMF  block  level  statistic  may  be  described  statistically  by 


JAMF 


(M-N)p 


U, 


(133) 


where 


U  ~  h,2(M-N)  (2Mps) , 


(134) 
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and,  as  previously  described,  p  is  Beta  distributed. 


Chapter  4 


Detector  Non-linearities 


Once  the  data  in  the  block  has  been  compressed  to  a  univariate  statistic, 
it  is  possible  to  apply  detector  nonlinearities  that  may  improve  the  sequential 
detection  performance.  As  seen  in  sections  3.1.2  and  3.2.2,  the  conditional  dis¬ 
tributions  of  the  GLR  and  AMF  statistics  are  functions  of  a  scalar  quantity  s, 
the  signal-to-interference  ratio.  Assuming  that  the  signal-to-interference  ratio  is 
known,  the  optimal  detector  non-linearity  has  the  form  of  a  log-likelihood  ratio,  as 
described  in  section  2.2.  When  the  signal  strength  is  not  known,  and  weak  signals 
are  of  interest,  the  locally  optimal  detector  non-linearity  is  indicated  as  a  reason¬ 
able  solution,  as  described  in  section  2.2.  In  this  chapter,  the  optimal  detector 
is  proposed  for  a  specified  value  of  signal-to-interference  ratio,  termed  the  design 
SIR,  and  the  locally  optimal  detector  is  derived  for  the  GLR  and  AMF  statistics. 
In  order  to  apply  Page’s  test  to  the  locally  optimal  detector  non-linearities,  a  bias 
must  be  applied  so  that  the  mean  of  the  detector  non-linearity  is  negative  when 
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no  signal  is  present.  It  is  seen  that  the  bias  may  be  chosen  to  equivalently  mini¬ 
mize  the  signal-to-interference  ratio  required  to  achieve  a  prescribed  asymptotic 
efficacy  or  to  maximize  the  asymptotic  efficacy  for  a  prescribed  SIR. 

4.1  Optimal  Detector  Non-linearity  for  a  Design  SIR 

Designing  a  detector  that  is  optimal  for  a  specified  SIR  may  yield  a  test  that 
has  adequate  performance  over  a  wider  range  of  SIR.  The  log-likelihood  ratio 
form  of  the  detector  non-linearity  requires  the  evaluation  of  the  unconditioned 
probability  density  function  (PDF)  of  the  block  level  statistic  under  the  null 
hypothesis  ( H ),  where  s  =  0,  and  under  the  alternative  hypothesis  (K)  for  s  =  s, 
where  s  is  the  design  signal-to-interference  ratio, 

i!(z)=iogftiS]’  (135) 

where  fz  (z\s)  describes  the  unconditional  distribution  of  the  block  level  statistic, 
Z ,  for  a  signal-to-interference  ratio  equal  to  s. 

The  evaluation  of  the  unconditioned  PDF  of  the  block  level  statistics  requires 
evaluating  a  one  dimensional  integral  of  the  non-central  Fisher’s  /  density  func¬ 
tion,  which  in  turn  is  represented  by  an  infinite  summation.  As  described  in  [26] 
there  exist  several  approximations  to  the  cumulative  distribution  function  of  a 
non-central  Fisher’s  /  random  variable.  The  most  reasonable,  in  terms  of  compu¬ 
tational  requirements  and  accuracy,  seems  to  be  the  Three-Moment  Approxima¬ 
tion  proposed  by  Tiku  [27].  Thus,  the  unconditional  PDF’s  are  evaluated  using 
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Tiku’s  Three-Moment  Approximation  and  numerical  integration,  as  described  in 
appendix  B. 


4.2  Locally  Optimal  Detector  Non-linearity 


The  locally  optimal  detector  non-linearity, 


9{z)  = 


Tsfz  Ws) 


J  5=0 


fz(z\s  =  0) 


(136) 


requires  forming  the  partial  derivative  of  the  block  level  statistic  PDF  when 
a  signal  is  present  with  respect  to  the  signal-to-interference  ratio  followed  by 
normalization  by  the  block  level  statistic  PDF  when  no  signal  is  present. 


4.2.1  GLR  Statistic 

The  unconditional  distribution  of  the  GLR  block  level  statistic  may  be  de¬ 
scribed  as 


fz(z\s)  =  Ep\fz{z\s,p)],  (137) 

where,  as  described  in  (105),  the  PDF  conditioned  on  knowledge  of  p,  fz  (-z|s,  p), 
is  a  non-central  Fisher’s  /  distribution.  Taking  the  partial  derivative  with  respect 
to  s  inside  the  expectation  over  p  and  applying  the  chain  rule  for  derivatives 
results  in 


d_ 

ds 


fz  (z\s)  =  E p 


9  .  f  \  ,06' 


(138) 
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where  6  —  2 Mps  is  the  non-centrality  parameter  of  the  Fisher’s  /  density.  The 
conditional  PDF  of  the  GLR  statistic  may  be  written  as 


.  T(M-N  +  k  +  l)  (M  -  N)m~k  zk 

fz(z\s,p)  A,  k,  T{M-N)T(k  +  l)[M-N  +  z)M~N+k+v 


from  equation  (105)  and  appendix  C.  Taking  the  partial  derivative  of  the  condi¬ 
tional  PDF  with  respect  to  the  non-centrality  parameter,  6,  requires  the  derivative 


forms 


d  _s  k 
8Se  16  ~ 


e  2  8k  1  (k  —  0  k  >  1 


-V  k  =  0 


(140) 


which  are  evaluated  at  S  =  0.  Thus,  k  =  0  and  k  =  1  are  the  only  non-zero  terms 
in  the  infinite  sum, 


-Jz(z\s,p) 


(. M  -  N)M-N 
2F  (M  -  N) 


r(M-N+2)z 

(. M-N+z)M~N +_i 


r(M-jy+i) 


{M-N) 


M-N+l 


(M-N  +  l)z 


2(M-N  +  z)M~N+1  L  M-iV  +  2: 

(M  -  AT)M~iV+1  /  z-1  \ 
2(M-N  +  z)m-n*'  U  +  ' 


(141) 


Combining  this  result  with 


(142) 


in  equation  (138)  evaluated  at  s  =  0  results  in 


,rEr 

J  s“°  L\i+  M-N ) 


Mp  (  z -  1 

sM-N+l 
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MEP  \p] 


z-1 


M-N+l 


(l  +  m-n) 

M-N+l  ( 

(\  -f - z\m~N+1  ( 

V1  ^  M-N) 

where  the  mean  of  p  has  the  form  (see  appendix  C) 


1  + 


M-N  . 

z-1 


1  + 


M-N . 


(143) 


Ep  \p]  = 


M-N  +  l 
M 


(144) 


Setting  6  =  0  in  (139)  and  taking  the  expectation  over  p ,  as  described  in  (137), 
results  in 


fz  (z\s  =  0) 


Ep 


T(M-N+1)  ( M-N)m~n 
T(M-N)  ( M-N  +  Z)M~N+1 

1 


Z 

M-N 


) 


M-N+l  ■ 


(145) 


Here  it  is  seen  that  the  unconditional  PDF  of  the  GLR  block  level  statistic,  when 
no  signal  is  present,  is  a  central  Fisher’s  /  distribution.  Now,  forming  the  locally 
optimal  non-linearity  of  equation  (136)  yields 


g(z)  =  (M-N  +  l)— j .  (146) 

As  seen  in  Broder  [9],  the  asymptotic  efficacy  is  invariant  to  scale  changes  of  the 
detector  non-linearity.  Thus,  the  non-linearity  of  equation  (146)  may  be  scaled 
so  that  it  approaches  one  as  2  approaches  infinity.  This  results  in  the  form 


z-1 


9glr  (z)  — 


z  +  M-N' 


(147) 


49 


4.2.2  AMF  Statistic 


The  unconditional  probability  density  function  for  the  AMF  block  level  statis¬ 
tic  may  be  described  as  in  equation  (137).  The  locally  optimal  detector  non¬ 
linearity  may  then  be  found  by  evaluating  the  partial  derivative  of  the  conditional 
distribution  of  the  block  level  statistic  as  described  in  equation  (138).  The  con¬ 
ditional  distribution  of  the  AMF  statistic,  as  seen  in  equation  (132),  is  a  scaled 
non-central  Fisher’s  /  distribution.  Taking  the  partial  derivative  with  respect  to 
the  non-centrality  parameter  and  setting  it  equal  to  zero  yields  results  similar 
to  that  of  the  previous  section  where  the  k  =  0  and  k  =  1  terms  are  the  only 
non-zero  terms  in  the  summation, 


d_ 

ds 


fz(z\s,p ) 


J  s=0 


2F  (M  -  N) 


r(M-N  +  2)pz  T{M-N+1 ) 


(1  +  pz) 


M-N+2 


(. M-N ) 


{M-N+  1)  zp2 


(1  +  pz) 

P 


M-N+l 


(148) 


L  (1  +  pz)M~N+ 2  (1  +  pz)M~N+1 . 

Combining  this  result  with  equation  (142)  in  equation  (138)  evaluated  at  s  =  0 


results  in 


d_ 

ds 


fz  (*|s) 


=  M  (M  -  N) 


Js=0 


(M-N  +  l)  zEc 


„3 


(1  +pz) 


M-W+2 


-E„ 


(1 +pz)M~ti+1 


=  M(M-N)[(M-N  +  l)zh3t2(z)-h2>1(z)},  (149) 


where 


hi,j  (z)  — 


l 


(1  +  pz)M~N+j_ 

T(M)  pM~N+i  (1  -  p)N~2 
p=o  r  [n  - 1)  r  (m  -  n  + 1)  (i  +  pz)M~N+j 


-dp.  (150) 
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By  setting  s,  and  therefore  <5,  equal  to  zero  in  the  conditional  PDF  of  the  AMF 
statistic  found  in  equation  (132),  and  taking  the  expectation  with  respect  to  p, 
the  unconditional  PDF  when  no  signal  is  present  is  formed, 


fz  (*|s  =  0) 


(M  -  N)  Ep 


(1  +  pz)M~N+1 


C M-N )  hltl  (z). 


(151) 


Placing  equations  (149)  and  (151)  into  equation  (136),  the  AMF  locally  optimal 
non-linearity  is  formed, 


g  (z)  =  M  (M  -  N  +  1)  2^44  -  Mjrr\- 
y  J  v  h1A  {z)  h1A  (z) 


(152) 


As  with  the  GLR  locally  optimal  non-linearity,  the  AMF  non-linearity  is  scaled 
so  that  its  limit  is  one  as  z  goes  to  infinity.  It  is  first  recognized  that 


lim  ZM-N+jhitj(z)  =  lim  E 

-r w /■%/">  3j  v  '  7. — ►no 


=  E 


L(^  +  ^) 

— (M— JV— i+i)l 


M—N+j 


(153) 


Using  the  probability  density  function  of  the  standard  Beta  distribution,  the 
negative  moments  of  p  are  seen  to  be 


when 


T  (M)  T  (Af  -  N  -  K  +  1) 
-  T  (M  -  K)T  (M  —  N  +  1) 


(154) 


K  <M-N+  1. 


(155) 
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Applying  equations  (153)  and  (154)  to  the  limit  of  equation  (152)  as  z  — »  oo 
results  in 


lim  g  (z)  =  lim  M 


1  +  L)  zV-Whw  {z)  zM~N+lhltl  (. z ) 


=  M 


(M-N  +  l) 


E  \p-{M-N-l) 


E 


1) 


E  [p~(M-N)}  E  [p-(M-N)] 


M  (M-N) 
N 


(156) 


where 


E 


-{M-N- 1) 


E[p-W-N)} 


r  (M)r  (2)r  {N)r  (m  -  n  + 1) 
r  (a  + 1)  r  (m  —  w  + 1)  r  (M)  r  (i) 
i 

N' 


(157) 


Scaling  equation  (152)  by  the  inverse  of  equation  (156)  results  in  the  locally 
optimal  non-linearity 


9amf  {*)  — 


N 


M-N 


(M-N  +  l)z 


h3p  (z)  h2,i  (z) 


(158) 


'hxp(z)  h1A(z)\' 

In  general  the  locally  optimal  non-linearity  for  the  AMF  statistic  is  not  strictly 
increasing.  Thus,  although  it  approaches  one  in  the  limit,  the_  maximum  may 
actually  be  greater  than  one.  The  hid  (z)  functions  do  not  pose  serious  imple¬ 
mentation  problems  as  they  may  be  easily  evaluated  using  numerical  integration 
and  stored  in  tables. 


4.3  Optimal  Bias  For  Page’s  Test 

As  described  in  chapter  2  ,  when  a  detector  non-linearity  is  applied  to  Page’s 
test,  the  mean  of  the  non-linearity  must  be  negative  when  no  signal  is  present  and 
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positive  otherwise.  A  log-likelihood  ratio  non-linearity  satisfies  this  requirement, 
however,  a  locally  optimal  non-linearity  may  not.  Stahl  and  Willett  [10],  following 
Dyson  [8],  have  approached  this  problem  by  subtracting  a  bias  proportional  to 
the  square  of  the  signal  strength.  The  proportionality  constant  is  chosen  by 
maximizing  the  fourth  derivative  of  the  asymptotic  efficacy  as  the  signal  strength 
tends  to  zero. 

An  alternative  solution,  proposed  herein,  is  to  choose  the  subtractive  bias 
that  achieves  a  desired  asymptotic  efficacy  with  a  minimum  signal  strength.  In 
actual  detector  operation,  a  particular  performance  is  typically  desired  and  the 
strength  of  an  observed  signal  is  rarely  known  a  priori.  Thus,  tuning  the  detector 
to  yield  the  desired  performance  at  a  minimum  signal  strength  is  an  appealing 
approach.  It  will  be  shown  that  this  method  is  equivalent  to  choosing  the  bias 
that  maximizes  the  efficacy  given  a  fixed  signal-to-interference  ratio.  Both  the 
proposed  method  and  that  of  Stahl  and  Willett  result  in  a  minimum  detectable 
signal  level,  which  is  the  consequence  of  a  negative  detector  non-linearity  mean 
when  signals  below  the  detectable  level  are  present. 

4.3.1  Derivation  of  the  Optimal  Bias 

Let  gT  (x)  be  the  detector  non-linearity  to  be  applied  to  Page’s  test, 

9r  0)  =  9o  (x)  -  r,  (159) 

where  r  represents  the  applied  bias  and  go  ( x )  is  the  original  non-linearity.  It  is 
assumed  that  the  non-linearity  is  operating  on  a  univariate  test  statistic  modeled 
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by  the  random  variable  X  with  probability  density  function  fx  (x).  Requiring  the 
mean  of  gT  (X)  to  be  negative  when  no  signal  is  present  and  positive  otherwise 

yields 

E0  [gT  (X)}  <0<ES  [gT  (X)} ,  (160) 

where  the  subscripts  0  and  s  on  the  expectations  respectively  represent  the  no 
signal  and  signal  strength  equal  to  s  cases.  Applying  the  bias  form  of  the  non¬ 
linearity  results  in 

E0  [go  ( X)}  -r<Q<Es[g0  (X)]  -  r,  (161) 

or 

E0  [go  (X)]  <  r  <  Es  [g0  (X)] .  (162) 

Equation  (161)  illuminates  the  minimum  detectable  signal  phenomenon  where, 
for  a  specific  value  of  r  >  0,  there  will  be  some  sMdl  for  which  only  s  >  sMdl 
will  satisfy  the  right  hand  inequality  of  (161). 

The  detector  non-linearity,  gr  (x),  is  now  applied  to  the  definition  of  the 
asymptotic  efficacy  (12), 

V  =  tT Es  [gT  (X)] 

=  tT  (Es  [<7o  (X)]  —  r) ,  (163) 

which  may  be  solved  for  the  mean  of  the  unbiased  non-linearity  when  signal  is 
present  to  yield 

e»  b»  mi  =  f + t, 


(164) 


54 


where  tT  is  the  non-zero,  unity  root  of  the  moment  generating  function  of  the 
detector  non-linearity  with  bias  r.  Thus,  tT  satisfies 


1  —  E0 


J't9t(X) 


Substituting  in  the  bias  form  of  the  non-linearity  results  in 


1  =  E0  Jetr(9°(x)-r)' 


=  e_4rTEn 


JTg0(X) 


which  may  be  solved  for  r  to  yield 

logEo 


(165) 


(166) 


(167) 


Substituting  this  form  of  r  into  equation  (164)  yields 


Es  [5o  (^)]  = 


7]  +  logEo 


^•r9o{X) 


tT 


(168) 


Under  the  assumption  that  there  is  a  one-to-one  relationship  between  the  bias, 
r,  and  the  corresponding  moment  generating  function  root,  tT ,  minimizing  the 
mean  of  the  unbiased  detector  non-linearity  when  signal  is  present  with  respect 
to  r  may  be  equivalently  performed  by  minimization  with  respect  to  tT.  There 
is  also  an  implicit  assumption  that  Es  [50  (X)]  is  monotonically  increasing  in  s  so 
maximizing  the  latter  is  equivalent  to  maximizing  the  former.  Proceeding,  the 
partial  derivative  of  (168)  with  respect  to  tT  is  seen  to  be 


^e,[9oW]  =  I 


t2r 


tr  -n  r  +  V  Eg 


Eo  [etr9°(x)] 


str9o{X) 


(169) 
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which  is  equal  to  zero  when  tT  satisfies 


T)  —  tT 


E0  [gp  (X)e^x\ 


—  log  E0  \eU9o{x) 


(170) 


,/_bT  E0[e^oW]  L  J  v  ' 

To  show  that  this  is  a  minimum,  consider  the  second  derivative  with  respect  to 

tTi 


~E,  [j„  (X)] 


2,  1  [  Eo[go(X)e^>1  _  r„oOTi 

Tr  ti  [  T  Eo[e‘-9"W]  2  g  I  ( 

tr Eo  Eo  [gg  (X)e^»'x>] 

t^Eo  [e(T9°W]2 

Ep  [9o  (X)  e‘Tg0^)j  (Eo  [e^°W]  +  trE0  jgp  (X)  e‘^W]) 
t^Eo  [etrS°W]2 

1  [e0  \gl  (X)  e^W]  /Eo  \g0  (X)  e^°  W]  \  2' 

Tt  Eole^soWj  {  E0[e*^oW]  J 

21  [  E»[,o(X)e<^)]  |  P  LirgoPOll 

Tff  T  Eo  [<*»<*>]  ”  8  °l  J 


Eo 

gl  (X)  etr9°W 

(E0 

)1 

Eq  [etTS°W] 

^  Eo  [e^50^)] 

J  J 

(171) 


Noting  that  the  first  derivative  is  equal  to  zero  when  tT  satisfies  equation 
(170),  convexity  is  assured  at  this  tT  when 


Ep  \g2Q  (X)  e^° W]  (  E0  [g0  (X)  e*^ 


E0  [etr90^x)] 


E0  [etr9o(x)] 


(172) 


or,  equivalently, 


Eo  [e‘'M(x)]  E0  [gg  (X)  e1'9”™]  >  E„  [g0  (X)  e‘'9”w] 2 . 


(173) 
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This  inequality  is  shown  to  be  true  by  using  the  integral  form  of  the  Schwarz 
inequality,  as  found  in  Hardy  [28], 


J  u2  (x)  dx  J  v2  ( x )  dx>  (^J  u  (x)  v  (x)  dx 


(174) 


where  equality  holds  only  if  the  non-negative  functions  u  and  v,  when  appropri¬ 
ately  scaled,  are  equivalent1  , 


Au  (a:)  =  Bv  (x) , 


(175) 


where  A  and  B  are  not  both  equal  to  zero.  Applying  the  Schwarz  inequality  to 
equation  (173)  where 

u(x)  =  [eU9o{x)fx(x)f  (176) 


and 

(177) 

yields 

E  [etT5oW]  E  [g2  (X)  eU9o{x)]  >  E  [\g0  (X)  \  eU9o{x)] 2 

>  E[g0(X)eU9°W]2 ,  (178) 

thus  proving  convexity  and  indicating  that  choosing  tT  to  satisfy  equation  (170) 
minimizes  the  mean  of  the  unbiased  detector  non-linearity,  (168).  The  equality 
in  equation  (178)  will  hold  only  when 

9o  (x)  =  c,  (179) 

xTwo  functions  are  equivalent  if  they  are  equal  except  on  a  set  of  zero  measure. 
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where  c  is  any  real  constant,  which  is  not  a  reasonable  detector  function  for  the 
block  level  statistics  considered  in  this  dissertation. 

Now,  supposing  that  the  signal-to- interference  ratio,  and  thus  Es  [<?0  (X)],  is 
fixed,  the  bias  may  be  chosen  to  maximize  the  asymptotic  efficacy  of  equation 
(163).  Consider  the  partial  derivative  of  equation  (163)  with  equation  (167)  with 
respect  to  tT 


Xn  =  E.MX)]- 


EoLpOe^V 


E0  [e^oPO] 


(180) 


which  is  clearly  set  to  zero  by  choosing  tT  to  satisfy 

E0  \g0(X)e^x'> 


E.[a>(X)]  = 


E0  [e^soPO] 


(181) 


That  this  choice  maximizes  the  asymptotic  efficacy  is  insured  by  noticing  that 
the  second  derivative 


d2  E0  [gl  (X)  e^°W]  E0  [>*»<*>]  -  E0  [go  {X)  e^oW 

atf77  =  E0  [e^Wf 

<  0, 


1  2 


(182) 


which  provides  concavity.  The  strict  inequality  follows  from  equation  (178)  where 
the  detector  non-linearity  is  not  allowed  to  have  the  form  seen  in  equation  (179). 

The  equivalence  of  the  two  methods  described  for  choosing  the  moment  gener¬ 
ating  function  root,  and  thus  the  detector  bias,  may  be  argued  using  a  graphically 
oriented  proof  by  contradiction.  The  equivalence  is  formulated  by  the  following 
theorem  where  A  =  Es  [go  (X)]: 
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Theorem  4.1  If  the  detector  bias  is  chosen  at  r  =  r*,  to  minimize 
A  for  a  fixed  value  of  the  asymptotic  efficacy,  rj  =  rf,  yielding  the 
minimum  A*,  then  t*  also  maximizes  77  over  all  valid  biases,  r,  for  the 
fixed  value  of  the  unbiased  detector  non-linearity  mean  when  signal 
is  present,  A*. 


Proof:  Suppose  that  r*  does  not  maximize  the  asymptotic  efficacy,  rj.  Then  there 
exists  a  bias,  f^r*,  such  that 

V,  (A*)  >  r,T.  (A*) ,  (183) 

where  r)T  (A)  is  the  asymptotic  efficacy  for  the  detector  with  bias  r  and  non¬ 
linearity  mean  A  when  signal  is  present.  Using  equation  (163)  this  may  be  de¬ 
scribed  graphically  as  seen  in  figure  3. 


Figure  3:  Efficacy  versus  detector  non-linearity  mean  when  signal  is  present. 


Equation  (183)  results  in 


tf  (A*  —  f)  >  iT.  (A*-t*). 


(184) 
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Now,  consider  the  value  of  A  that  sets  the  asymptotic  efficacy  to  77*  for  the  bias 
T 

Vf  (a)  =  tf(x-  f) 

=  rf 

=  tT.  (A*  -  r*) .  (185) 

This  is  represented  by  point  A  in  figure  3.  Combining  equations  (184)  and  (185) 
results  in  the  inequality 

tf  (A*  -  f )  >  tf  (A  —  rj 

X*  >  A,  (186) 

which  is  evident  from  the  graph  of  figure  3  when  tf  >  0.  This  result  contradicts 
the  assumption  that  the  bias  r*  was  chosen  to  minimize  A  over  all  biases  that 
yield  asymptotic  efficacy  equal  to  rf.  Thus,  choosing  the  bias  to  minimize  the 
detector  non-linearity  mean  when  signal  is  present  for  a  fixed  asymptotic  efficacy 
is  equivalent  to  choosing  the  bias  to  maximize  the  asymptotic  efficacy  for  a  fixed 
detector  non-linearity  mean  when  signal  is  present. 

4.3.2  Implementation  Concerns 

In  practice,  the  asymptotic  efficacy,  a  function  of  the  mean  time  between 
false  alarms  and  the  mean  delay  to  detection,  would  be  prespecified.  From  the 
desired  efficacy,  equation  (170)  is  inverted  to  determine  the  required  value  of  tT. 
This  value  of  the  moment  generating  function  root  is  used  to  determine  the  bias 
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required  to  achieve  the  desired  efficacy  by  substitution  into  equation  (167).  As 
equations  (167)  and  (170)  are  not  of  simple  form,  it  is  not  clear  if  there  exists  a 
one-to-one  transformation  from  r]  to  tT  or  if  equation  (167)  exists  for  reasonable 
tT. 

From  equation  (170),  when  tT  =  0,  rj  =  0  as  well.  If  rj  is  a  monotone,  strictly 

increasing  function  of  tT ,  then  for  each  rj  >  0  there  exists  a  unique  tT  >  0  that 

satisfies  equation  (170).  To  show  that  g  is  strictly  increasing  in  tT,  the  partial 

derivative  is  shown  to  be  positive, 

dv  E0  \g0  (X)  e**>  <*>]  +  frE0  \g20  (X)  e^x\ 
dfT  Eo  [e^oW] 

irEo  [go  (X)  e^°W]2  E0  \g0  (X)  e^°W' 

E0  [e^°W]2  E0  [e*^°W] 

_  E0  [e^W]  E0  [gl  pQe^W]  -  E0  [<?0  ( X )  e^oW]2 
tr  Eo[e^W]2 

>  0,  (187) 

when  tT  >  0,  where  the  results  of  equation  (178)  are  used  restricting  g0  ( x )  so 
that  the  strict  inequality  applies. 

The  relationship  between  tT  and  r  is  described  functionally  by  equation  (167). 
Originally,  tr  was  described  as  the  value  of  t  >  0  where  the  moment  generating 
function  of  gT  (A), 

Mr(t)  =  E0  [et(9oW-T)] 

=  e"TtE0  [etSoW] , 


(188) 
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equals  one.  The  fact  that  tT  >  0  is  assured,  as  seen  by  applying  Jensen’s  inequal¬ 
ity,  because  the  mean  of  the  biased  detector  non-linearity  is  negative,  (160).  For 
a  fixed  value  of  t  >  0,  say  t*,  Mr  (£*)  is  a  strictly  decreasing  function  of  r,  as  seen 
by  the  partial  derivative  of  equation  (188), 


§-TMAn  = 


— t*e-t*TEn 


J*go(X) 


<  0 


(189) 


for  all  r.  This  implies  that  there  can  exist  at  most  one  value  of  r,  say  r*,  such 
that  Mt»  (r)  =  1.  That  this  value  exists  is  insured  by  the  continuity  of  Mr  (t*) 
in  r  and  the  fact  that 

lim 

Mr  (i*)  =  0  (190) 

T  00 

and 

lim 

Mr  (f )  =  oo  (191) 

r  — ►  —oo 

for  t*  >  0.  This,  however,  does  not  insure  that  t*  is  positive. 


4.3.3  Example 

To  illustrate  the  use  of  equations  (170),  (181),  and  (167)  to  determine  the 
moment  generating  function  root  and  the  associated  optimal  bias,  the  GLR 
block  level  test  statistic  is  evaluated  using  the  GLR  locally  optimal  detector 
non-linearity  of  equation  (147).  Curves  relating  the  optimal  bias  to  the  desired 
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efficacy  are  shown  in  figure  4  for  the  adaptive  dimension,  N  =  1,  and  various 
block  sizes.  The  SIR  required  to  achieve  a  specified  asymptotic  efficacy  is  shown 
in  figure  5  as  a  function  of  the  applied  bias.  Evaluation  of  equations  (170)  and 
(167)  results  in  the  optimal  choice  of  bias  which  is  seen  to  require  the  minimum 
SIR.  The  efficacy  achieved  for  various  SIR  values  as  a  function  of  the  applied 
bias  is  shown  in  figure  6  where  it  is  seen  that  the  optimal  choice  of  bias  results 
in  the  maximum  efficacy. 


Figure  4:  Optimal  bias  as  a  function  of  the  asymptotic  efficacy  for  N  =  1  and 
block  sizes  M  —  5,  10,  and  15  for  the  GLR  block  level  statistic  with  the  locally 
optimal  non-linearity. 


GLR  Statistic  /  GLR-LO  Nonlinearity 
*  -  Optimal  choice  of  detector  bias 


Detector  Bias 


Figure  5:  SIR  required  to  achieve  a  specified  efficacy  as  a  function  of  the  applied 
bias  for  N  =  1  and  M  =  10  for  the  GLR  block  level  statistic  with  the  locally 
optimal  non-linearity. 


Figure  6:  Efficacy  achieved  for  specified  values  of  signal-to-interference  ratio  as 
a  function  of  the  applied  bias  for  N  =  1  and  M  =  10  for  the  GLR  block  level 
statistic  with  the  locally  optimal  non-linearity. 


Chapter  5 


Performance  Analysis  of  Block  Detection 


In  chapter  3,  the  GLR  and  AMF  block  level  statistics  were  derived.  In  chap¬ 
ter  4,  the  log-likelihood  ratio  detector  non-linearity  was  described  and  the  lo¬ 
cally  optimal  detector  non-linearity  was  derived  for  the  GLR  and  AMF  block 
level  statistics.  In  this  chapter  the  combinations  of  the  GLR  and  AMF  block 
level  statistics  and  the  design  SIR  and  locally  optimal  detector  non-linearities 
are  compared  in  terms  of  the  asymptotic  efficacy  achieved  for  various  signal-to- 
interference  ratios.  The  best  of  the  four  combinations  will  then  be  examined  as 
a  function  of  the  block  size  and  the  adaptive  dimension  which  is  the  number  of 
elements  in  each  data  vector. 

5.1  Preliminaries 

Adaptive  algorithms,  including  the  one  presented  in  this  dissertation,  often 
require  the  estimation  and  inversion  of  covariance  matrices.  When  the  dimension 
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of  the  data  is  large,  for  instance  an  array  with  a  large  number  of  sensors,  ade¬ 
quate  estimation  of  the  covariance  matrix  requires  a  large  number  of  samples  and 
the  inversion  of  the  covariance  matrix  estimate  becomes  numerically  inhibitive 
as  well  as  potentially  unstable.  Matrix  preprocessing  of  the  array  data  to  a 
smaller  dimension  reduces  numerical  computations  often  with  near  equivalent 
average  performance  as  shown  by  Owsley  and  Abraham  [29]  and  also  provides 
improved  statistical  performance  as  shown  by  Burgess  and  Van  Veen  [30].  Matrix 
preprocessing  entails  a  linear  transformation  of  P  dimensional  sensor  data  to  N 
dimensional  data  using  a  constant  N-by-P  matrix.  Gray’s  [31]  beamspace  prepro¬ 
cessing  algorithm  is  of  particular  interest  because  the  adaptive  dimension  can  be 
dramatically  reduced  with  minimal  loss  in  performance.  The  performance  may  be 
quantified  by  the  array  gain  improvement  (AGI),  as  defined  by  Owsley  [32],  of  the 
preprocessor  as  a  function  of  the  adaptive  dimension.  The  signal-to-interference 
ratio  for  the  preprocessed  data  is  the  array  gain  improvement  multiplied  by  the 
SIR  for  the  conventional  processing  case,  which,  for  beam  space  preprocessing, 
has  the  form 


SIR  (N)  =  AGI  (N)  SIR  (N  =  1) .  (192) 

As  the  adaptive  dimension,  N,  is  increased  towards  the  total  number  of  sensors, 
the  AGI  increases.  In  certain  types  of  interference  environments,  in  particular 
those  with  strong  plane  wave  interferers,  the  array  gain  improvement  can  be  quite 
large,  for  instance,  on  the  order  of  ten  decibels. 
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The  efficacy,  as  described  in  chapter  2,  is  a  function  of  the  average  number 
of  samples  between  false  alarms  and  the  average  delay  before  detection.  By 
implementing  Page’s  test  on  blocks  of  data,  these  parameters  are  defined  at  the 
block  level.  However,  the  detection  algorithm  must  be  designed  based  on  the 
performance  at  the  sample  level.  If  M  is  the  block  size,  if  t]b  is  the  asymptotic 
efficacy  at  the  block  level,  and  if  TB  and  DB  are  respectively  the  average  number 
of  blocks  between  false  alarms  and  before  detection,  the  sample  level  asymptotic 
efficacy  may  be  described  as 

T]  = 


log  (MTb) 

MDb 

log  {Tb)  log  (M) 
MDb  MDb 
Vb_  log  (M) 

M  MDb  ' 


(193) 


Varying  DB  results  in  upper  and  lower  bounds  on  the  sample  level  asymptotic 
efficacy  in  terms  of  the  block  size  and  block  level  efficacy 

I*  <  <22  log_(A£) 

M  1  ~  M  M  K  } 


Equation  (194)  may  be  solved  for  the  block  level  efficacy  yielding  the  form 


Mr]  -  log  (M)  <r)B  <  Mr]. 


(195) 


The  following  performance  analysis  describes  the  performance  of  the  block  de¬ 
tection  algorithm  in  terms  of  either  the  signal-to-interference  ratio  required  to 
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achieve  a  specified  efficacy  or  one  of  the  sample  level  efficacy  bounds  of  equa¬ 
tion  (194)  achieved  by  a  specified  SIR.  Evaluation  of  the  efficacy  requires  knowl¬ 
edge  of  the  mean  of  the  detector  non-linearity  as  a  function  of  the  signal-to- 
interference  ratio  and  the  moment  generating  function  root  of  the  detector  non¬ 
linearity.  Both  of  these  require  the  probability  density  function  of  the  block 
level  statistic.  Numerical  integration  is  used  to  form  tables  of  the  PDF  of  the 
block  level  statistics  when  no  closed  form  exists.  Similarly,  tables  are  formed  for 
the  design  SIR  non-linearities  and  the  locally  optimal  non-linearity  for  the  AMF 
statistic. 

5.2  Performance  Analysis 

The  first  objective  is  to  evaluate  the  performance  of  all  the  combinations 
of  the  block  level  statistics  and  the  detector  non-linearities.  Figures  7  and  8 
respectively  contain  curves  of  the  lower  bound  on  the  sample  level  efficacy  for 
the  design  SIR  and  locally  optimal  non-linearities  comparing  the  GLR  and  AMF 
statistics  as  a  function  of  the  signal-to-interference  ratio  for  the  block  size  M  —  15 
and  the  adaptive  dimensions  N  =  1,  5,  and  10.  The  design  SIR  non-linearities 
were  chosen  to  be  optimal  for  a  signal-to-interference  ratio  of  s  =  0  dB  and  the 
locally  optimal  non-linearities  were  implemented  with  a  bias  chosen  to  maximize 
the  efficacy  for  s  =  0  dB.  Note  that  for  the  N  =  1  case,  the  GLR  and  AMF 
block  level  statistics  are  identical.  It  is  clear  that  for  both  non-linearities  the 
GLR  statistic  outperforms  the  AMF  statistic. 
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Figures  9  and  10  respectively  contain  curves  of  the  lower  bound  on  the  sample 
level  the  efficacy  for  the  AMF  and  GLR  statistics  comparing  the  design  SIR  and 
locally  optimal  non-linearities  as  a  function  of  the  signal-to-interference  ratio  for 
the  block  size  M  —  15  and  the  adaptive  dimensions  N  =  1,  5,  and  10.  Both 
figures  reflect  equivalent  performance  at  s  =  0  dB,  a  result  of  the  non-linearities 
being  optimized  by  design  or  bias  for  this  level.  The  locally  optimal  non-linearity 
performs  worse  than  the  design  SIR  non-linearity  for  the  AMF  statistic  and  yet 
performs  slightly  better  than  the  design  SIR  non-linearity  for  the  GLR  statistic. 
The  degradation  increases  as  the  SIR  is  displaced  from  the  design  value,  especially 
for  smaller  values  of  SIR.  The  difference  in  performance  at  displaced  SIR  values 
decreases  as  the  adaptive  dimension  increases  for  the  GLR  statistic. 

These  comparisons  indicate  that  the  locally  optimal  detector  non-linearity 
operating  on  the  GLR  statistic  provides  the  best  performance  while  the  locally 
optimal  detector  non-linearity  operating  on  the  AMF  statistic  provides  the  worst 
performance.  This  is  an  appealing  result  because  the  GLR  statistic  has  a  simpler 
conditional  probability  density  function,  particularly  when  no  signal  is  present, 
as  the  PDF  is  central  chi-squared.  Additionally,  the  locally  optimal  non-linearity 
for  the  GLR  statistic  has  a  closed  form  representation  where  the  design  SIR  non- 
linearities  and  the  locally  optimal  non-linearity  for  the  AMF  statistic  require  a 
quantized  implementation.  All  further  analysis  will  be  performed  for  the  GLR 
statistic  with  its  locally  optimal  non-linearity. 
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It  is  now  desired  to  analyze  the  performance  of  the  detector  as  a  function 
of  the  adaptive  dimension.  Figure  11  contains  curves  of  the  SIR  required  to 
achieve  a  particular  value  of  efficacy  as  a  function  of  the  adaptive  dimension. 
The  SIR  is  chosen  so  that  the  block  level  efficacy  meets  the  upper  bound  of 
equation  (195).  The  bias  chosen  to  implement  the  detector  minimizes  the  SIR 
required  to  achieve  the  desired  efficacy.  Efficacies  ranging  from  0.1  to  0.9  are 
shown  for  a  block  size  of  M  =  15  and  adaptive  dimensions  varying  from  N  =  1 
to  N  =  13.  The  utility  of  these  curves  is  shown  by  the  dashed  curve  superimposed 
on  the  constant  efficacy  contours.  This  curve  is  the  signal-to-interference  ratio 
of  equation  (192)  for  beamspace  preprocessed  array  data  as  a  function  of  the 
adaptive  dimension  of  the  preprocessor  for  a  specific  interference  scenario  and 
beamforming  look  direction.  Varying  the  SIR  at  the  output  of  a  conventional 
beamformer,  SIR(N  =  1),  simply  shifts  the  dashed  curve  up  or  down  which 
in  turn  indicates  the  adaptive  dimension  that  provides  the  best  performance. 
These  curves  may  be  used  to  determine  the  preprocessor  type  and  dimension  that 
provide  the  best  performance  in  a  specific  interference  scenario.  The  scenario  for 
this  example  consisted  of  a  thirty-two  sensor,  half  wavelength,  equally  spaced 
line  array  beamformed  broadside  to  the  array.  The  interferences  consisted  of 
plane  waves  at  —2,  2,  and  5  degrees  from  broadside  respectively  at  the  levels  13, 
10,  and  10  dB  above  spatially  uncorrelated  noise  and  a  Butterworth  angularly 
extended  interfering  signal  at  4  degrees  and  10  dB  with  a  spreading  coefficient 
of  p  —  0.5.  If  the  interference  scenario  was  restricted  to  spatially  uncorrelated 


71 


noise,  the  SIR  curve  would  be  constant  as  the  adaptive  dimension  increased  which 
would  indicate  that  conventional  beamforming  should  be  used  to  maximize  the 
efficacy.  This  result  is  expected  as  the  conventional  beamformer  maximizes  the 
beam  output  SIR  for  spatially  uncorrelated  background  noise. 

In  the  previous  analyses,  the  block  size  has  been  held  fixed  at  M  =  15.  The 
block  size  may  be  limited  from  above  by  the  degree  of  the  non-stationarity  of 
the  interfering  signals  as  well  as  the  loss  in  time  resolution  and  is  bounded  below 
by  M  >  N  +  1.  The  signal-to-interference  ratios  required  to  achieve  the  upper 
and  lower  bounds  on  the  block  level  efficacy  of  equation  (195)  for  an  asymptotic 
efficacy  of  77  =  0.5  for  N  =  1,  5,  and  10  are  shown  in  figure  12  as  a  function  of 
the  block  size.  The  lower  bound  curve  for  the  N  =  1  case  clearly  indicates  that  a 
block  size  of  M  —  5  yields  the  minimum  required  SIR  to  achieve  77  =  0.5.  These 
curves  may  be  used  to  choose  a  block  size  that  provides  adequate  performance 
without  violating  stationarity  assumptions  or  substantial  loss  in  time  resolution. 


SIR  (dB) 


Figure  7:  Efficacy  curves  for  the  design  SIR  non-linearity  for  the  GLR  and  AMF 
statistics  for  M  =  15,  N  =  1,  5,  and  10. 


SIR  (dB) 

Figure  8:  Efficacy  curves  for  the  locally  optimal  non-linearity  for  the  GLR  and 
AMF  statistics  for  M  =  15,  N  =  1,  5,  and  10. 


Figure  9:  Efficacy  curves  for  the  AMF  statistic  for  the  design  SIR  and  locally 
optimal  non-linearities  for  M  =  15,  N  =  1,  5,  and  10. 
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Adaptive  Dimension 

Figure  11:  SIR  required  to  achieve  constant  efficacy  as  a  function  of  the  adaptive 
dimension  for  the  GLR  locally  optimal  detector  with  a  block  size  of  M  =  15. 
The  dashed  line  represents  the  SIR  for  a  broadside  look  direction  for  a  specific 
interference  scenario  with  beamspace  preprocessing  of  a  sixty-four  sensor,  half 
wavelength,  equally  spaced  line  array  where  the  conventional  beam  output  SIR 
is  SIR  (N  =  l)  =  -3  dB. 


Block  Size  -  M 


Figure  12:  Upper  and  lower  bounds  on  the  SIR  required  to  achieve  a  sample  level 
efficacy  of  rj  =  0.5  as  a  function  of  block  size  for  N  =  1,  5,  and  10  for  the  GLR 
locally  optimal  detector. 


Chapter  6 


Post  Block  Processing 


Once  Page’s  test,  implemented  using  the  detector  non-linearities  described  in 
chapter  4,  has  indicated  that  a  signal  has  been  detected,  further  processing  of 
the  data  may  provide  improved  time  resolution  of  the  estimate  of  the  onset  time. 
In  speech  processing  or  classification  applications,  this  will  provide  improved 
segmentation  of  short  time  duration  signals.  Page’s  test  operates  on  the  block 
level  data,  providing  a  coarse  estimate  of  the  onset  time  of  the  signal.  Assuming 
that  the  interference  is  stationary  over  a  length  of  time  that  straddles  the  signal 
onset  time,  a  maximum  likelihood  estimate  of  the  signal  onset  time  may  be  found 
for  the  deterministic  signal  model  described  in  section  3.1.1. 
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6.1  Derivation 
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where 

X  -  Xi  x2  •••  Xm  ’ 

(199) 

M 

S  = 

(200) 

.  M-P  +  l 

Pp  =  M  ’ 

(201) 

i  M 

xp  M-P  +  l5Xl’ 

(202) 

and 

- 

BP(0)  =  S  -  /3P0dx£  -  /3PrxPdH  +  0P00*dd" 

(203) 

The  maximum  likelihood  estimate  for  the  unknown  start  time  requires  the  max- 

imization  of  equation  (198)  with  respect  to  £,  6,  and  P.  It  is  recognized, 

as  seen 

in  section  3.1.1,  that  the  maximization  over  £  requires  choosing 

£  =  BP  (0). 

(204) 

The  resulting  likelihood  function  is 

r  a-N  iM 

/(X|£,0,P)  [^|Bp(^)|J  ’ 

(205) 

which  may  be  equivalently  maximized  over  6  by  minimizing  the  determinant  of 

Bp  (0).  The  closed  form  of  the  determinant  is  seen  to  be 

|BP  (0)  j  =  S  -  pP0 dxf  -  /3p9*x.pdH  +  pP00* ddH 

- 

=  |Sf  IN  +  ppS-'DED11  , 

(206) 

where 

D  —  #d  xp  > 

(207) 
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Using  the  determinant  identity 

| In  +  AB|  =  |Im  +  BA| , 


(208) 


(209) 


where  A  is  n-by-m  and  B  is  ra-by-n,  as  found  in  Kailath  [33],  results  in 

|BP(0)|  =  |S|  |ljv  +  ftp  (S_1DE)  T>h 

=  |S|  |l2  +  /?pDHS_1DE|  (210) 

dn  di2  1  -1 

I2  +  Pp 

d,2\  d,22  ~  1  0 

1  +  0p  (dn  —  di2)  —  Ppdn 

=  |S| 

f3p  (d2i  —  d22)  1  —  Ppd 21 

—  |S|  (1  + /?p  (dn  —  di2))  (1  — /?pd2i)  + /?pdn  (cfei  —  d22) 

=  ] S |  1  +  /3pdn  (1  —  0pd22)  —  Pp  (d2i  +  di2)  +  /?pd2idi2  ,  (211) 


where  I2  is  the  two  dimensional  identity  matrix  and 

dn  di2 

d21  d22 

Substituting  equation  (207)  into  (212)  results  in 


dhs-1d  = 


dll  =  0rdHS-xd  (213) 

di2  =  0*dHS_1xp  (214) 
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g?2i  =  0XpS  1d 


(215) 


d22  =  XpS  xxp, 


(216) 


which,  when  substituted  into  equation  (211)  and  completing  the  complex  square 
for  9 ,  results  in 


Bp  (0)|  =  |S|  [l  +  fa  (G99*  -  0x^S_1d  -  0*d* S_1xp) 


=  ISI 


=  ISI 


1  —  Pp 


1  —  ftp 


dHS-^p 


G 


dHS -xxp 


G 


+  /?pG 


+  PpG 


00*  _  ^s-ld 


-g*d^S-1xP  + 


G 

dHS-1x  I2 


dHS-1xP 


G 


yj 


(217) 


where 


G  =  daS_1d  (l  -  0Pi$ S-'xp)  +  *  d"S-'xp 


(218) 


If  G  >  0,  equation  (217)  is  minimized  by  setting 


6  = 


dHS~] 


xP 


(219) 


which  results  in 


f(x\t,e,p)  = 


(ire) 


-N 


M 


(ire) 


-NM 


|S| 


M 


1  + 


Pp 

dHS~1xp\ 

2 

dFS-1d  (1  —  /3PXpl 

S_1xp) 

n  M 


(220) 


The  maximum  likelihood  estimate  of  P  is  found  by  choosing  the  value  that  max¬ 
imizes  equation  (220).  This  requires  a  search  over  values  of  P  that  do  not  violate 
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the  G  >  0  condition  nor  the  requirement  that  the  estimate  of  the  interference 
covariance  matrix,  XJ,  be  positive  definite.  Considering  the  first  requirement, 


0 

0 

1 

P 


<  G 

<  dHS-M(i  -pPx 

( 

<  Pp 

XpS  xxp  — 

/ 

>  M  | 

1 - - - 

^  (d^S-M 

dHS-1xp 


l2\ 


d^S^d 
dHS-1d 


+ T 


(221) 


it  is  seen  that  a  lower  bound  is  placed  on  P.  The  bound  itself  is  a  function  of  P, 
thus,  it  must  be  evaluated  for  all  potential  start  times  from  1  to  M. 

The  second  requirement  was  that  the  estimate  of  the  interference  covariance 
matrix,  BP  ( 9 ),  be  positive  definite.  This  is  shown  by  first  proving  that  BP  (#) 
is  positive  semi-definite  for  all  6  using  a  quadratic  form  and  then  by  requiring 
the  determinant  of  BP  (d'j  to  be  greater  than  zero.  By  placing  the  block  data 
columnwise  into  the  N-by-M  matrix  X,  the  matrix  S  and  the  vector  xP  may  be 
written  as 


S  = 

M 


(222) 


and 


1 

M- P  +  1 


XeF, 


xP  = 


(223) 
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where  ep  represents  an  M- by-1  dimensional  vector  with  zeros  in  the  first  P  —  1 
elements  and  ones  elsewhere, 

r 

0 


ep  = 


The  matrix  Bp  (9)  may  now  be  written  in  the  form 


(224) 


BP  (9)  =  S  -  /3P5cp5cp  +  (5P 


xP  -9d 


1  H 


xp  —  6d 


J-XX^ _ 

M  (M-P  + 1)2' 


Xe^epX^  +  /3pVV 


H 


M' 


I M  ~ 


epef 


M-P+l 


xH  +  pPwIi, 


H 


(225) 


where  1m  is  the  M  dimensional  identity  matrix  and 


V  = 


xp  —  9d 


(226) 


Applying  this  form  of  Bp  {6)  to  a  quadratic  form  in  y  results  in 


yHBP  (9) y  =  —yHXUXHy  +  j3P yHWHy, 


(227) 


where 


U  = 


I M  ~ 


epe.p 


M-P+l 


(228) 
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As  the  latter  term  in  equation  (227)  is  clearly  the  inner  product  of  the  vector 
VHy,  it  must  be  greater  than  or  equal  to  zero,  where  the  equality  may  occur 
because  V  is  at  most  rank  2.  The  matrix  X  has  rank  equal  to  N  because  its 
product,  S  =  jjXXH,  has  rank  equal  to  N.  The  matrix  U  is  recognized  as  a 

projection  matrix  with  rank  equal  to  M  -  1.  Thus,  any  quadratic  form  involving 

the  matrix  U  must  lie  between  zero  and  one, 

0  <  zHXJz  <  1.  (229) 

The  first  term  of  equation  (227)  is  a  quadratic  form  of  the  matrix  U  and  the  M 
dimensional  vector  XHy.  Thus,  the  quadratic  form 

yHBP  (9)y  >  0,  (230) 


and  the  matrix  BP  (0)  is  positive  semi-definite  for  all  6. 

The  necessary  requirement  that  the  determinant  of  BP  (jfj  be  positive  yields 
the  constraint, 
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which  places  a  tighter  bound  on  P  than  equation  (221).  The  resulting  maximum 
likelihood  estimator  of  P  may  be  written  as 


P  =  arg  max 


{M-P+  1)  d-HS~1xp 


pc4>  M  -  (M  -  P  +  1)  xf  S-ixp ; 


(232) 


where 


<f>  = 


1  <  P  <  M  and  P  >  M 


XpS_1xp 


(233) 


6.2  Performance  Analysis 


The  post  block  processing  algorithm  of  equations  (232)  and  (233)  is  examined 
in  this  section  for  performance  as  a  function  of  the  true  start  time,  the  signal-to- 
interference  ratio,  and  the  adaptive  dimension.  Two-hundred  trials  were  run  for 
the  adaptive  dimensions  N  =  1,5,  and  10,  and  for  block  sizes  M  =  10,  15,  20,  25, 
and  30,  excepting  the  combination  N  =  10  and  M  =  10.  Signal-to-interference 
ratios  ranging  from  s  =  —  5  dB  to  10  dB  in  one  decibel  increments  were  considered. 
For  each  case,  the  estimated  start  time  was  computed  for  each  possible  true  start 
time.  The  results  for  the  cases  were  very  similar,  exhibiting  expected  trends. 
Thus,  the  N  =  5,  M  =  20  case  is  used  to  present  these  trends.  Histograms  for 
several  true  start  times  for  the  s  =  5  dB  case  are  found  in  figure  13.  Note  that 
when  the  signal  starts  in  the  latter  portion  of  the  block  (the  upper  histogram), 
there  is  a  skewness  to  the  left  that  is  larger  than  the  corresponding  skewness  to 
the  right  displayed  by  the  histogram  of  the  signal  that  starts  in  the  initial  portion 
of  the  block  (the  lower  histogram).  This  may  be  explained  by  there  being  very 
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few  samples  with  signal  present  when  the  signal  starts  in  the  latter  portion  of  the 
block,  causing  a  more  difficult  estimation  problem.  Figure  14  contains  histograms 
for  a  fixed  true  start  time  for  signal-to-interference  ratios  varying  from  -5  to  10 
dB  in  three  decibel  increments.  The  mean  squared  error  is  estimated  from  the 
histograms  and  shown  in  figure  15  as  a  function  of  the  true  start  time  for  SIR 
values  of  -5,  0,  5,  and  10  dB.  The  asymmetry  of  the  error  noted  in  the  histograms 
is  also  seen  here  where  the  mean  squared  error  is  largest  for  signals  that  start  in 
the  latter  quarter  of  the  block.  Weak  signals  incur  larger  errors  when  they  start 
near  either  boundary,  with  the  best  performance  somewhere  near  the  middle  of 
the  block.  The  mean  squared  error  is  averaged  over  all  possible  true  start  times 
and  plotted  against  the  signal-to-interference  ratio  for  the  adaptive  dimensions 
jV  =  1,  5,  and  10  in  figure  16.  Here  it  is  seen  that  there  is  minimal  loss  between 
the  N  =  1  case  and  the  IV  =  5  case  indicating  that  the  performance  losses 
associated  with  processing  multivariate  data  do  not  accrue  rapidly. 

6.3  Invariance 

In  considering  equations  (232)  and  (233),  it  is  not  clear  if  the  post  block 
maximum  likelihood  estimate  of  the  signal  onset  time  is  invariant  to  the  inter¬ 
ference  covariance  matrix.  By  applying  the  whitening  transformation  of  section 
3.1.2  described  by  equations  (43)-(55)  it  may  be  shown  that  the  probability  den¬ 
sity  function  of  the  forms  that  describe  the  maximum  likelihood  estimator  of  the 
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signal  onset  time, 


d^S  xxp 


(234) 


and 

XpS-1xp,  (235) 

depend  on  the  interference  covariance  matrix  only  through  the  parameter  (3  de¬ 
scribed  by  equations  (47)  and  (48).  The  parameter  (3  may  be  thought  of  as 
a  whitened  complex  strength  parameter  with  magnitude  equal  to  the  signal-to- 
interference  ratio.  The  dependence  of  the  post  block  processing  performance  on 
the  SIR  is  acceptable.  The  effect  of  the  dependence  on  the  whitened  phase  of  the 
signal  is  not  clear,  however,  it  is  much  better  than  dependence  on  an  N  dimen¬ 
sional  covariance  matrix.  The  simulations  of  section  6.2  were  conducted  with  an 
identity  matrix  for  an  interference  covariance  matrix  and  for  (3  —  a/s. 


89 


Estimated  Start  Time 


Figure  13:  Histograms  of  the  onset  time  estimate  for  several  true  start  times  for 
an  SIR  of  5  dB  for  IV  =  5  and  M  =  20.  The  asterisk  represents  the  true  start 
time.  Each  histogram  has  been  scaled  by  the  same  value  to  facilitate  comparison. 
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Figure  14:  Histograms  of  the  onset  time  estimate  for  several  SIR  values  for  N  =  5 
and  M  —  20.  The  asterisk  represents  the  true  start  time.  Each  histogram  has 
been  scaled  by  the  same  value  to  facilitate  comparison. 
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Figure  15:  Mean  squared  error  of  the  onset  time  estimate  for  several  SIR  values 
as  a  function  of  the  true  start  time  for  N  =  5  and  M  =  20. 


Figure  16:  Mean  squared  error  of  the  onset  time  estimate  averaged  over  all  true 
start  times  for  N  =  1,  5,  and  10  for  M  =  20  as  a  function  of  the  signal-to- 
interference  ratio. 


Chapter  7 


Block-Post  Block  Simulation 


The  block  detection  algorithm  proposed  and  analyzed  in  chapters  3  -  5  and 
the  post  block  processing  to  improve  the  signal  onset  time  estimate  of  chapter  6 
are  applied  to  simulated  array  data  for  a  stationary  and  a  non-stationary  inter¬ 
ference  scenario.  The  array  and  signal  processing  consist  of  a  half  wavelength, 
equally  spaced,  thirty-two  sensor  line  array  with  frequency  domain  plane  wave 
beamforming  to  a  broadside  look  direction.  As  suggested  in  chapter  5,  beamspace 
preprocessing  is  used  to  reduce  the  adaptive  dimension  to  AT  =  1  providing  con¬ 
ventional  beamforming,  and  to  N  =  5  yielding  a  potential  improvement  due  to 
adaptive  processing  if  warranted  by  the  interference  scenario.  This  simulation 
study  will  be  conducted  in  two  parts:  first,  the  block  detection  performance  will 
be  evaluated  by  examining  histograms  of  the  time  between  false  alarms  and  the 
time  before  detection  as  well  as  the  average  time  before  detection  versus  sig¬ 
nal  strength;  second,  the  combined  block  detection-post  block  processing  will  be 
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evaluated  by  histograms  of  the  estimated  start  time  and  by  considering  the  mean 
squared  error  as  a  function  of  the  signal  strength. 

7.1  Interference  Scenario 

The  stationary  interference  scenario  consists  of  the  following:  plane  wave 
interferences  at  2  degrees  and  10  dB,  5  degrees  and  10  dB,  and  —2  degrees  and 
13  dB,  a  Butterworth  angularly  spread  interference  at  4  degrees  and  10  dB  with 
a  spread  parameter  of  p  =  0.5,  and  spatially  uncorrelated  noise  at  0  dB.  All 
angles  are  from  broadside  to  the  array.  A  scale  is  applied  to  this  interference 
scenario  so  that  the  conventional  beam  output  interference  power  is  0  dB  at 
broadside  to  the  array.  Thus,  evaluating  signal  strengths  ranging  from  —5  to 
5  dB  is  effectively  considering  signal-to-interference  ratios  ranging  from  —5  to 
5  dB  for  a  conventional  beamformer  (N  =  1).  The  non-stationary  interference 
scenario  begins  with  the  stationary  scenario  and  has  linear  changes  in  the  angular 
position,  strength  in  decibels,  and  spread  of  the  interference  signals  as  found  in 
table  1.  The  Butterworth  angularly  spread  interference  results  in  a  plane  wave 
when  p  =  1  and  in  spatially  uncorrelated  noise  when  p  =  0.  The  interference 
scenario  is  periodic  with  period  equal  to  3000  samples.  Over  one  period,  the  beam 
output  interference  power  for  conventional  beamforming  ( N  —  1),  for  beamspace 
preprocessing  to  N  =  5  beams,  and  for  fully  adaptive  beamforming  as  a  function 
of  time  may  be  found  in  figure  17.  The  difference  between  the  conventional  beam 
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output  interference  power  and  either  adaptive  beam  output  interference  power 
(in  dB)  is  the  array  gain  improvement  for  the  respective  adaptive  beamformer. 


Signal 

Time  Index 

Angle  (deg) 

Strength  (dB) 

Spread 

1 

1 

-2° 

13 

1.0 

1000 

-1° 

10 

1.0 

2000 

-3° 

15 

1.0 

3000 

-2° 

13 

1.0 

2 

1 

2° 

10 

1.0 

500 

3° 

5 

1.0 

2000 

2° 

13 

1.0 

3000 

2° 

10 

1.0 

3 

1 

4° 

10 

0.5 

■ESilNi 

8° 

15 

0.3 

3000 

4° 

10 

0.5 

4 

1 

5° 

10 

1.0 

■H!1H 

3° 

10 

1.0 

3000 

5° 

10 

1.0 

5 

1 

n/a 

0 

0.0 

3000 

n/a 

0 

0.0 

Table  1:  Non-stationary  interference  scenario.  The  interference  parameters 
change  linearly  between  the  specified  time  indices. 


7.2  Block  Detection  Performance 

The  GLR  locally  optimal  detector  is  implemented  with  an  average  time  be¬ 
tween  false  alarms  of  T  =  20  blocks  and  a  block  size  of  M  =  10  samples.  The 
thresholds  were  set  by  using  equation  8  of  chapter  2.  Histograms  of  the  time 
between  false  alarms  for  the  stationary  and  non-stationary  interference  scenarios 
are  found  respectively  in  figures  18  and  19.  The  observed  average  time  between 
false  alarms  was  approximately  25  times  the  design  value  for  N  =  1  and  10  times 


Beam  Output  Interference  Power  (dB) 
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Figure  17:  Beam  output  interference  power  for  conventional  beamforming  (N  = 
1),  for  beamspace  preprocessing  to  N  =  5  beams,  and  for  fully  adaptive  beam¬ 
forming  for  the  non-stationary  interference  scenario. 
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the  design  value  for  N  =  5.  As  the  design  value  is  only  a  lower  bound,  the 
difference  is  acceptable,  however,  tighter  bounds  would  provide  more  accurate 
threshold  computation.  As  expected,  the  time  between  false  alarms  was  slightly 
smaller  for  the  non-stationary  interference  due  to  the  lack  of  stationarity  within 
each  block.  The  average  times  between  false  alarms  are  still  substantially  larger 
than  the  design  value  although  this  may  be  affected  by  increasing  the  block  size. 

Histograms  of  the  number  of  blocks  before  detection  for  the  stationary  and 
non-stationary  interference  scenarios  are  found  respectively  in  figures  20  and  21 
for  the  N  =  1  and  N  =  5  cases  for  several  signal  strengths.  There  seems  to  be 
minimal  difference  between  the  stationary  and  non-stationary  interference  sce¬ 
narios.  As  seen  in  figure  17  the  beam  output  interference  power  only  changes 
about  0.75  dB  during  the  first  300  time  samples  (30  blocks)  for  the  N  =  1  and 
N  =  5  case,  which  may  explain  the  minimal  difference.  The  average  number  of 
blocks  before  detection,  as  a  function  of  the  signal  strength,  is  found  in  figure  22. 
Again,  the  minimal  difference  between  the  stationary  and  non-stationary  interfer¬ 
ence  scenarios  is  observed  where  the  non-stationary  case  seems  to  have  a  slightly 
smaller  average  time  before  detection.  This  may  be  accounted  for  by  the  increase 
in  the  variance  of  the  block  level  statistics  due  to  the  non-stationarity  of  the 
interference  over  one  block  as  in  the  smaller  time  between  false  alarms.  The  pre¬ 
dicted  performance,  calculated  by  using  the  approximation  of  equation  10  found 
in  chapter  2,  is  shown  in  the  figure  as  well,  where  it  is  seen  that  the  predicted 
values  are  no  more  than  one  block  below  the  observed  values.  The  advantage  in 
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preprocessing  the  array  data  to  a  specific  dimension  is  evident  in  that  for  signal 
strengths  below  0.5  dB,  the  N  =  5  case  results  in  better  detection  performance 
while  for  larger  signal  strengths,  the  N  =  1  case  slightly  outperforms  the  N  =  5 
case.  This  may  also  be  observed  in  the  required  SIR  curves  in  figure  11  of  chap¬ 
ter  5  where  the  best  preprocessor  dimension  may  change  with  signal  strength. 

7.3  Block-Post  Block  Performance 

A  block  of  data  thirty  samples  long  immediately  previous  to  the  block  detec¬ 
tion,  including  the  block  in  which  detection  occurred,  is  subjected  to  the  post 
block  processing  algorithm  of  chapter  6.  Histograms  of  the  onset  time  estimate 
of  the  combined  block-post  block  processing  algorithm  for  the  stationary  and 
non-stationary  cases  are  found  respectively  in  figures  23  and  24  for  various  sig¬ 
nal  strengths.  The  histograms  are  referenced  to  the  true  starting  time,  which 
was  a  randomly  selected  position  within  a  block.  The  weaker  signal  strength 
cases  are  clearly  biased  to  the  right,  which  is  the  result  of  a  delay  in  the  block 
detection  large  enough  so  that  the  true  start  time  occurred  outside  of  the  data 
segment  being  processed.  Minimal  difference  is  noticed  between  the  stationary 
and  non-stationary  interference  scenarios. 

The  mean  squared  error  of  the  onset  time  estimate  as  a  function  of  the  signal 
strength  for  the  stationary  and  non-stationary  interference  scenarios  and  the 
predicted  performance  are  found  in  figure  25.  The  N  =  5  case  produced  less 
mean  squared  error  than  the  N  =  1  case  as  was  expected  due  to  the  improvement 
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indicated  by  the  array  gain  improvement  of  this  interference  scenario.  The  error 
at  low  signal  strengths  is  more  than  an  order  of  magnitude  greater  than  the 
predicted  value,  particularly  for  the  N  =  1  case,  as  a  result  of  the  larger  delay 
in  detection  previously  described.  The  error  for  signal  strengths  greater  than  0 
dB  is  on  the  same  order  as  the  predicted  values.  It  is  not  expected  that  the 
error  be  exact,  as  the  predicted  value  is  the  mean  squared  error  averaged  over 
all  possible  start  times  which  does  not  exactly  represent  what  is  occurring  in  the 
simulation.  As  seen  in  the  histograms  of  the  number  of  time  samples  before  block 
detection  occurs  found  in  figure  26,  the  true  starting  time  of  the  signal  is  not  even 
approximately  uniformly  distributed  in  the  segment  of  data  that  is  applied  to  the 
post  block  processing  (the  first  thirty  samples).  Prom  this  figure  it  is  also  clear 
that  at  low  signal  strengths  there  are  a  substantial  number  of  trials  where  the 
post  block  processing  segment  does  not  include  the  true  start  time,  leading  to  an 
increased  mean  squared  error. 

Due  to  the  sequential  nature  of  the  block  detection  algorithm,  the  average 
time  before  detection  increases  as  the  signal-to-interference  ratio  decreases.  This 
has  a  very  debilitating  effect  on  the  post  block  processing  leading  to  substan¬ 
tial  errors.  Increasing  the  length  of  the  segment  of  data  submitted  to  the  post 
block  processing  may  ease  this,  however,  the  length  is  likely  to  be  limited  by  the 
stationarity  of  the  interference.  An  alternative  may  be  choosing  the  segment  of 
data  submitted  to  the  post  block  processing  through  the  use  of  an  estimate  of 
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the  signal-tointerference  ratio  over  the  block  in  which  detection  occurs  and  the 
predicted  time  before  detection. 


Figure  18:  Histograms  of  the  number  of  blocks  between  false  alarms  for  N  =  1 
and  N  =  5  with  a  block  size  of  M  =  10  for  the  stationary  interference  scenario. 
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Figure  19:  Histograms  of  the  number  of  blocks  between  false  alarms  for  N  =  1 
and  N  =  5  with  a  block  size  of  M  —  10  for  the  non-stationary  interference 
scenario. 
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Figure  20:  Histograms  of  the  number  of  blocks  before  detection  for  N  =  1  and 
N  =  5  with  a  block  size  of  M  =  10  for  various  signal  strengths  for  the  stationary- 
interference  scenario.  Each  histogram  has  been  scaled  by  the  same  value  to 
facilitate  comparison. 
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Figure  21:  Histograms  of  the  number  of  blocks  before  detection  for  N  =  1  and 
N  =  5  with  a  block  size  of  M  =  10  for  various  signal  strengths  for  the  non¬ 
stationary  interference  scenario.  Each  histogram  has  been  scaled  by  the  same 
value  to  facilitate  comparison. 
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Figure  22:  Average  number  of  blocks  before  detection  for  N  =  1  and  N  —  5  with 
a  block  size  of  M  =  10  as  a  function  of  signal  strength  for  the  stationary  and 
non-stationary  interference  scenarios  and  the  predicted  performance. 
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Figure  23:  Histograms  of  the  post  block,  maximum  likelihood  estimate  of  the 
starting  time  referenced  to  the  true  starting  time  for  N  =  1  and  N  =  5  with 
a  block  size  of  M  =  30  for  various  signal  strengths  for  the  stationary  interfer¬ 
ence  scenario.  Each  histogram  has  been  scaled  by  the  same  value  to  facilitate 
comparison. 
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Non-Stationary 


-10  3 5  5  10  15  20  25  30 


Estimated  start  time 

Figure  24:  Histograms  of  the  post  block,  maximum  likelihood  estimate  of  the 
starting  time  referenced  to  the  true  starting  time  for  N  =  1  and  N  =  5  with  a 
block  size  of  M  =  30  for  various  signal  strengths  for  the  non-stationary  interfer¬ 
ence  scenario.  Each  histogram  has  been  scaled  by  the  same  value  to  facilitate 
comparison. 
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Figure  25:  Mean  squared  error  of  the  post  block,  maximum  likelihood  estimate  of 
the  starting  time  for  N  =  1  and  N  =  5  with  a  block  size  of  M  =  30  as  a  function 
of  signal  strength  for  the  stationary  and  non-stationary  interference  scenarios  and 
the  predicted  performance. 


108 


Stationary 


20  .40  60  80  100 

Number  of  samples  before  detection 


Number  of  samples  before  detection 

Figure  26:  Histogram  of  the  number  of  time  samples  before  block  detection  occurs 
for  IV  =  1  and  N  =  5  with  a  block  size  of  M  =  10  for  various  signal  strengths  for 
the  stationary  inteference  scenario.  Each  histogram  has  been  scaled  by  the  same 
value  to  facilitate  comparison. 


Chapter  8 


Conclusion 


This  dissertation  has  considered  the  general  problem  of  the  determination 
of  the  time  that  a  change  occurs  in  the  structure  or  parameterization  of  the 
distribution  of  a  sequence  of  random  vectors  when  some  subset  of  the  parameters, 
nuisance  parameters,  are  common  to  both  the  before  change  (null)  hypothesis 
and  the  after  change  (alternative)  hypothesis.  The  proffered  solution  entails 
segmenting  the  data  into  non-overlapping  blocks,  from  which  univariate  statistics 
are  formed  that  are  invariant  to  the  nuisance  parameters  under  the  null  hypothesis 
and  may  depend  on  at  most  an  unknown  scalar  strength  parameter  under  the 
alternative  hypothesis.  The  unknown  signal  strength  parameter  was  treated  by 
applying  a  locally  optimal  non-linearity  prior  to  submission  to  Page’s  test  for 
the  detection  of  the  change.  Once  Page’s  test  has  determined  that  a  change  has 
occurred,  post  block  processing  is  performed  to  improve  the  time  resolution  of 
the  change  time  estimation.  This  structure  is  appealing  for  it  adequately  deals 


109 


110 


with  non-stationary  nuisance  parameters,  as  the  block  level  statistic  is  invariant 
to  nuisance  parameter  changes  from  block  to  block  under  the  null  hypothesis. 

A  typical  radar  or  sonar  application,  the  detection  of  the  onset  of  a  narrow- 
band  signal  of  known  frequency  with  unknown  amplitude  and  phase  arriving  at 
an  array  of  sensors,  was  considered.  The  block  detection  performance  of  two 
block  level  statistics,  a  generalized  likelihood  ratio  (GLR)  statistic  and  an  adap¬ 
tive  matched  filter  (AMF)  statistic,  applied  to  a  log-likelihood  ratio  for  a  specified 
SIR  and  to  a  locally  optimal  non-linearity,  was  determined  as  a  function  of  the 
signal-to-interference  ratio.  The  GLR  statistic  combined  with  the  locally  opti¬ 
mal  non-linearity  outperformed  the  other  statistic-non-linearity  combinations  in 
terms  of  the  asymptotic  efficacy.  Additionally,  the  GLR-locally  optimal  detector 
has  the  desirable  properties  of  a  closed  form  detector  non-linearity  and  a  stan¬ 
dard  probability  density  function,  the  central  chi-squared  distribution,  when  no 
signal  is  present.  Implementation  of  the  detectors  utilizing  the  locally  optimal 
non-linearity  required  choosing  a  bias  that  insures  that  the  mean  of  the  detector 
non-linearity  is  negative  when  no  signal  is  present.  A  method  for  choosing  the 
bias  for  a  general  non-linearity  that  minimizes  the  signal  strength  required  to 
achieve  a  specified  asymptotic  efficacy,  or  to  equivalently  maximize  the  asymp¬ 
totic  efficacy  for  a  specified  signal  strength,  was  derived.  The  SIR  required  to 
achieve  a  specific  asymptotic  efficacy  was  found  as  a  function  of  the  adaptive  di¬ 
mension  for  the  GLR-locally  optimal  detector.  Here  it  was  seen  that  increasing 
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the  adaptive  dimension  increases  the  SIR  required  to  achieve  a  specific  perfor¬ 
mance  level.  Preprocessing  the  array  data  by  a  rank  reducing,  constant,  linear 
transformation  provides  a  method  for  changing  the  adaptive  dimension  of  the 
signal  detector  algorithm.  This  also  affects  the  SIR  that  the  detector  is  subjected 
to,  as  a  function  of  the  preprocessing  structure  and  the  interference  scenario.  The 
utility  of  the  required  SIR  curves  was  shown  by  the  superposition  of  the  effective 
SIR  achieved  by  varying  the  adaptive  dimension  of  beam  space  preprocessing  of 
the  array  data  for  a  specific  interference  scenario.  The  adaptive  dimension  pro¬ 
viding  the  SIR  that  yields  the  largest  asymptotic  efficacy  was  indicated  as  the 
best  preprocessor  for  the  observed  interference  scenario. 

A  post  block  processing  algorithm,  in  the  form  of  a  maximum  likelihood 
estimate  of  the  signal  onset  time,  was  derived  for  the  deterministic  signal  model 
using  the  sample  level  data  previous  to  the  block  detection,  up  to  the  maximum 
limit  of  stationarity.  The  performance  of  the  estimator,  in  terms  of  mean  squared 
error,  was  seen  to  degrade  as  the  signal-to-interference  ratio  decreased.  Minimal 
loss  was  incurred  between  adaptive  dimensions  N  —  1  and  N  =  5,  indicating  that 
there  may  be  substantial  gain  in  preprocessing  the  array  data  to  the  dimension 
appropriate  for  the  specific  interference  scenario. 

The  combined  sequential  block  detection-post  block  processing  algorithm  was 
implemented  on  simulated  array  data  for  stationary  and  non-stationary  inter¬ 
ference  scenarios.  The  block  detection  performance  indicated  that  the  observed 


112 


average  time  between  false  alarms  was  approximately  twenty-five  times  the  pre¬ 
dicted  lower  bound  for  N  =  1  and  ten  times  the  predicted  lower  bound  for  N  =  5 
for  a  block  size  of  M  =  10.  This  indicates  the  need  for  an  improved  method  of  an¬ 
alytically  determining  the  thresholds  required  to  achieve  a  specified  time  between 
false  alarms.  The  non-stationary  interference  caused  a  slightly  quicker  detection 
along  with  an  associated  decrease  in  the  average  time  between  false  alarms  due 
to  the  non-stationarity  of  the  interference  within  each  block.  The  post  block 
processing  algorithm  was  implemented  using  the  data  in  the  block  in  which  the 
detection  occurred  and  a  finite  amount  of  previous  data.  For  weak  signals,  the 
delay  in  the  block  detection  was  often  large  enough  so  that  the  segment  of  data 
the  post  block  processing  algorithm  operated  on  did  not  contain  the  onset  of  the 
signal.  This  indicates  that  using  an  estimate  of  the  signal-to-interference  ratio 
to  set  the  block  size  or  the  particular  segment  of  data  subjected  to  post  block 
processing  may  provide  improved  estimation  in  the  combined  algorithm. 

This  dissertation  has  explored  the  use  of  a  block  sequential  adaptive  scheme 
for  the  detection  of  the  onset  of  a  narrowband  signal  at  an  array  of  sensors  where 
the  signal  amplitude  and  phase  and  the  potentially  non-stationary  interference 
covariance  matrix  are  unknown.  The  block  detection  algorithm  may  be  easily  ex¬ 
tended  to  broadband  signals  or  to  narrowband  signals  with  unknown  or  varying 
frequency  by  the  appropriate  combination  of  block  statistics  at  several  discrete 
fourier  transform  frequency  bin  outputs.  The  threshold  required  for  implementa¬ 
tion  may  require  more,  although  not  substantial,  numerical  computation.  Further 
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research  is  also  suggested  in  the  following  related  areas:  applying  the  general  al¬ 
gorithm  to  other  array  signal  models,  the  method  of  the  block  data  compression, 
more  accurate  bounds  for  the  time  between  false  alarms,  and  online  methods  of 
choosing  the  data  to  be  submitted  to  post  block  processing. 


Appendix  A 


Selected  Theorems  and  Proofs 


The  following  theorems  are  extensions  to  the  complex  case  of  theorems  com¬ 
monly  found  in  the  analysis  of  multivariate  real  Gaussian  random  variables  and 
real  Wishart  matrices.  Background  and  distributional  information  on  multivari¬ 
ate  complex  Gaussian  random  variables  and  complex  Wishart  matrices  may  be 
found  in  Goodman  [23]  and  to  a  lesser  extent  Anderson  [34].  The  original  the¬ 
orems  may  be  found  in  either  Searle  [25],  Muirhead  [24],  or  DeVroye  [35].  The 
extensions  are,  for  the  most  part,  straightforward  and  result  from  following  the 
proofs  for  the  real  cases.  When  possible,  the  proofs  have  been  simplified  or  de¬ 
ferred. 


THEOREM  A.l 


The  following  theorem  is  an  extension  of  Theorem  2,  Chapter  2,  page  57  of 
Searle  [25]  describing  the  probability  density  function  of  certain  quadratic  forms 
involving  complex  Gaussian  random  vectors. 


Theorem  A.l  If  x  ~  CHn  (m,  E)  and  A  =  AH,  then  2xH  Ax  ~ 

Xf  (6)  if  and  only  if  AE  is  idempotent.  The  non-centrality  parameter, 

6  =  2jiH Aji,  and  the  degrees  of  freedom  parameter,  r  =  2tr(AE), 
where  tr  (AE)  is  the  trace  of  AE. 

Proof:  For  the  quadratic  form 

Q  =  2xHAx,  (236) 


consider  the  moment  generating  function 


MQ(t)  =  E 
=  E 


JQ 


„2txHAx 


L"  7T"  [E|  ^ 

L  ^meVdx’ 


2txHAx  —  (x  —  n)n  E  1  (x  —  n) 


h  v>-l 


dx 


(237) 
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where  Cn  represents  n  dimensional  complex  Euclidean  space.  The  exponent,  V , 
may  be  massaged  into  a  single  quadratic  form  in  x, 

V  =  xJJ(2iA)x-xff£“1x  +  fiH£~1x  +  xH£_V-f<H£~V 

=  -  (x -£)"  S’1  (x -£)+£*£-'£ (238) 

where 

E"1  =  5T1  -  2tA  (239) 


ji  =  ee  1(jl. 

The  moment  generating  function  thus  becomes, 

exp  -  /xHE~: Vl  r  exp  [-  (x  -  p)H  E_1  (x  -  p) 

MQ  (t)  =  ljr.l-1  Ln  yj-n  ^ 


(240) 


S  IEI 


exp  [axh  (S^SS"1  -  S'1)  At] 


(242) 


|E-iE| 

exp  \-nH  (ln  -  S-1  (E"1  -  2tA)_1)  E“  V] 

=  |In-  2tAS| 

exp  \-nH  (ln  -  (In  -  2tAE)_1)  E_V]  ,0/m 

|  In  —  2fAE| 

where  I„  is  the  n  dimensional  identity  matrix. 

Suppose  that  AE  is  idempotent  with  rank  or  trace, 

tr  (AE)  =  m.  (242) 

Note  that  the  eigenvalues  of  an  idempotent  matrix  are  either  zero  or  one,  and, 
by  using  a  singular  value  decomposition,  the  matrix  AS  may  be  expressed  as 

AE  =  UV*,  (243) 

where  the  n-by-m  matrices  U  and  V  are  orthogonal, 

VHU  =  Im.  (244) 

Part  of  the  matrix  in  the  quadratic  form  in  the  exponent  of  equation  (241)  may 
now  be  simplified, 

I„  -  (I„  -  2iA£)-‘  =  I„-(l„-U(2t)Vi')_I 

=  u  -  k  -  U  (VHU  -  ^I„)  V 

=  ^-sf v" 


1 - —  AE. 

1  -  2 1) 


(245) 
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The  determinant  in  the  denominator  of  equation  (241)  is  clearly 

|In  -  2tAS|  =  (1  -  2 t)m  (246) 

because  AS  has  m  non-zero  eigenvalues,  all  equal  to  one.  Substituting  (245)  and 
(246)  into  (241)  results  in 


Mg(t) 


exp 


-  0  -  A) 


—  e 


(1  -  2 t)m 

-u*AuexP  (l4^A/i) 

(l-2t)^ 


(247) 


The  moment  generating  function  of  a  non-central  Chi-squared  random  variable 
with  r  degrees  freedom  and  non-centrality  parameter  6,  as  found  in  Muirhead 
[24],  is 


Mxh  {t) 


_6  - 

e  2  e2(l-2t) 

(1-2  ' 


(248) 


Equating  equations  (247)  and  (248),  it  is  seen  that  the  quadratic  form,  Q ,  has  a 
non-central  Chi-squared  distribution  with  r  =  2m  =  2tr  (AS)  degrees  of  freedom 
and  non-centrality  parameter  6  =  2{iH  A\x. 

If  it  is  assumed  that  the  quadratic  form  has  the  described  non-central  Chi- 
squared  distribution,  it  can  be  seen  that  the  matrix  AS  must  be  idempotent 
with  rank  equal  to  tr  (AS)  =  |  by  equating  the  denominators  of  equations  (241) 
and  (248), 


(l-2i)5  =  |In  —  2tAS| 

=  f[{l~2tXi),  (249) 

i=i 

where  A j  are  the  eigenvalues  of  AS.  Clearly  the  left  and  right  sides  of  equation 
(249)  will  be  equal  only  when  |  of  the  eigenvalues  are  equal  to  one  with  the 
remaining  equal  to  zero,  which  results  in  an  idempotent  AS  matrix. 

THEOREM  A. 2 

The  following  theorem  is  the  extension  of  Theorem  4,  Chapter  2,  page  59  of  Searle 
[25]  showing  the  independence  of  quadratic  forms  involving  complex  Gaussian 
random  vectors. 

Theorem  A.2  If  x  ~  CJ\fn(/j,,  S),  A  =  AH  and  B  =  B^,  then 
xffAx  and  xffBx  are  independent  if  and  only  if  ASB  =  BSA  =  0. 
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Proof:  The  proof  is  identical  to  that  of  Searle  [25]  and,  as  the  result  is  also 
identical,  will  not  be  repeated. 


THEOREM  A.3 

The  following  theorem  is  an  extension  of  Theorem  3.2.10  on  page  93  of  Muirhead 
[24]  describing  the  probability  density  functions  of  the  partitions  of  a  complex 
Wishart  matrix  and  of  a  particular  function  of  the  partitions  that  enjoys  certain 
independence  properties. 

Theorem  A.3  If  the  n-by-n  matrix  A  ~  CWn  (m,  £)  with  A  and  £ 
partitioned  into  two-by-two  blocks, 


A  = 

'An 

A2i 

A 12 

a22 

£  = 

'  £n 
£2i 

£12 

£22 

1 

(250) 

where  An 

and  £n  are  k-by-k, 

and  if 

An-2  =  An 

—  Ai2 

a221a 

-21 

(251) 

and 

£n.2  =  £n 

-£i2: 

^'221^'21) 

(252) 

then 

1.  An-2 

~  CWk  (m  —  n  +  k, 

£11-2) 

and  is  independent  of  A 

12  and 

A22- 

2.  The  conditional  distribution  of  A12  given  A22  is 


CJ\f kx(n—k)  (S12S221  A22,  £n- 2  ®  A22)  ■ 

3.  A22  ~  CWn_fc  (m,  £22). 

Proof:  Consider  the  probability  density  function  of  the  complex  Wishart  dis¬ 
tributed  matrix,  A,  as  found  in  Goodman  [23], 

I  A  \m~n 

f  (A)  =  J  1  etr(-S-'A),  (253) 

77i, n  I" | 

where  ‘etr’  signifies  the  matrix  exponential  trace  operation  and 

r„,„  =  (m)  ■  •  -  r(m  -  n  +  1) 


(254) 
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where  T  (x)  is  the  standard  Gamma  function.  Applying  the  k,n  —  k  factorization 
to  the  determinants  of  equation  (253)  yields 


|A|  =  |  A22I 

An  —  Ai2A221A2i 

=  |A22|  1 

An.2| 

(255) 


and,  similarly, 


E|  =  IE22 1  |En.2|  • 


(256) 


The  determinant  of  a  partitioned  matrix  may  be  found  in  the  Matrix  Theory 
Appendix  of  Muirhead  [24].  The  trace  of  the  matrix  product  S_1A  in  equation 
(253)  must  now  be  massaged  into  a  form  containing  the  matrix  partitions  Ai2 
and  A22  and  the  matrix  An.2.  This  requires  substantial  algebraic  manipulation, 
beginning  with  the  matrix  factorization  of  the  inverse  of  the  matrix  E, 


E-1  =  V  = 


Vn  V12 
V21  V22 


(257) 


The  following  relationships  between  the  partitioned  matrix  and  the  partitioned 
inverse  may  be  easily  verified: 


Vu  = 
S2I1  = 


vr/v12 


SnX2 

v22  -  v21vr11  Vn 

— S12E22  • 


(258) 

(259) 

(260) 


Applying  these  to  the  aforementioned  trace  results  in 

(sr'A)  = 


tr 


tr 


Au 

A2i 


=  tr 


Vn  V12 
V2i  V22 

VnAn  +  V12A21 

V2iAu  +  V22A2i 


A12 

a22 

VnAi2  +  V12A22 
V2iAi2  +  V22A22 


tr  (VuAn)  +  tr  (Vi2A2i)  +  tr  (V2iAi2)  +  tr  (V22A22) 
tr  (V 11  An)  +  2tr  (V i2 A2i)  -f  tr  (V 22 A22) 


=  tr  (V n 


An-2  +  Ai2  A22^  A2i 


]) 


nAi2A22  A2i 


+  2tr  (Vi2A2i)  +  tr  (V22A22) 

) 


=  tr  (VhAii.2)  +  tr  ^Vi 

+2tr  (Vi2A2i)  +  tr  (V22A22) 

=  tr  ^E1112An.2)  +tr  (VnAi2A221A2ij  —  2tr  (VnEi2E22  A2i) 
+tr([E221  +  V2iV1T1Vi2]  A22) 

=  tr  ^E1112An.2^  +  tr  (ViiAi2A22A2i)  —  2tr  ^VnEi2E221A2ij 


+tr  ^E22A22)  +  tr  ^A22 


E22  E2i ViiEi2E. 


22 
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tr  +  tr  (VnAi2A221A2i)  —  tr  (ViiEi2T<221A2i) 

— tr  (VnAi2E22  S2i)  +  tr  (E22  A22) 


+tr( 

'V11E 

V, 

12^22^ ^-22^22  ^2l) 

=  tr  (l 

qi^An^)  +  tr  (^22^22) 

/ 

Ai2A22A2i  —  Si2S22A 

-22 A22  A2i 

\ 

+tr 

Vn 

—  Ai2A22  A22S22 

E21 

V 

+SX2S221  A22A22^  A22 

^22  ^21 

=  tr  (l 

■'li^An^)  +  tr  (S22  A22) 

+tr( 

^11-2 

S12S221  A22  —  A12  A22 

A22'E221  ^21 

— 

=  tr  (E^An.a)  +  tr  (£22^22) 

+tr  J^Ai2  —  Si2S22  A22  A22  A12  —  S12S22  A 


H 


(261) 

Substituting  equations  (255),  (256),  and  (261)  into  the  probability  density  func¬ 
tion  of  equation  (253)  yields 

fiA22r-n, 


/(A)  = 


Vnlsu7r  etr  (“^ll^All-s)]  ['tlir  etr  (  S22lA22 
•etr  E^  JA12  —  ^12^22^ A22]  A22  A12  —  £12^22^22]  ^  J 


|Aii-2 


\m-{n—k)-k 


Tm- 


m—(n—k),k  |£ll-2 


\m—(n—k) 


etr  (-Sn^2Au.2) 


[A12  —  Si2S22A22] 

A-1 

a22 

Ai2  —  S12E22A22 

y 

(n—k),k^'m, n- 

k  |A22|-fc 

r  m,n 

|Sn.2|n-fc 

- 

L22 


im-(n-fe) 


etr  (-E^Aas) 


T'm,n—k  |^22| 

=  /  (A11.2)  /(A12IA22)  /  (A22)  ,  (262) 

where  it  is  recognized  that  An,2  and  A22  have  complex  Wishart  distributions, 

A11.2  ~  CWk  (m  -  (n  -  fc),  Sn.2)  (263) 


and 


A22  ~  CWn-k  (m,  E22) ,  (264) 

thus  proving  the  first  and  third  parts  of  the  theorem.  The  independence  of  Au,2 
from  A12  and  A22  is  seen  from  the  factorization  of  the  probability  density  function 
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of  equation  (262).  The  probability  density  function  of  A12  conditioned  on  A22 
may  be  simplified  by  considering 


r  m—(n—k)  ,fc  f1 m,n—k 

fri 


m,n 


{r(m— n+k)---T(m— n+l)T(m)"T(m— w+fc+1) 

r(m)---r(m— n+1) 

-7j.i[fc(fc-l)+(n-fc)(n-fc-l)-n(n-l)] 

5  [fc2  —  k+n2— 2nk+k2— n+k— n2+n) 

7r-fc(n-fc)_ 


(265) 


Substituting  equation  (265)  into  the  conditional  probability  density  function  of 
A12  yields 


/  (A12IA22) 


A12  —  £12X22^22 

7i- )  |A22|fc 


Sxi-2 


Ai2  —  £12^22^  A 

I  n—k 


22 


(266) 


which  is  recognized  as  the  probability  density  function  of  a  complex  Gaussian 
random  matrix, 

A12IA22  ~  CAfkx{n-k)  (£i2£221A22>  ^11-2  <8>  A22)  ,  (267) 

proving  the  second  part  of  the  theorem. 


THEOREM  A. 4 

The  following  theorem  is  an  extension  of  Theorem  3.2.11  on  page  95  of  Muirhead 
[24]  describing  the  probability  density  function  of  the  inverse  of  a  matrix  quadratic 
form  involving  the  inverse  of  a  complex  Wishart  distributed  matrix. 

Theorem  A. 4  If  A  ~  CYvn  (to,  £),  £  is  full  rank,  and  P  is  a  k-by-n 
matrix  with  rank  fc,  then 

(PA^P^)'1  ~ CWk  (m-n  +  k,  (P£-1PH)“1^)  .  (268) 

Proof:  If  £  is  full  rank,  it  may  be  factored  into 

£  =  rr^,  (269) 

where  T  is  also  of  full  rank.  Set 

B  =  r_1A(rH)"1.  (270) 

Since  T  is  constant  and  of  full  rank,  B  is  distributed  as 


B  ~  CWn  (to,  In) . 


(271) 
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Define  the  k-by-n,  rank  k  matrix 

R=P(rH)_1.  (272) 

Substituting  equations  (272)  and  (270)  into  the  matrix  product  described  in  the 
theorem  results  in 

(PA-ip#)-1  =  (Rr^A-^R^)-1 

=  ^r  |t-1a  (rH)-1j  1rh^ 

=  (RB^R^)-1.  (273) 

Similarly, 

('ps-ipH)-1  =  (Ri^s^rR*)-1 

=  r^e  (rH)_1)  rk 

=  (RR*)_\  (274) 

Using  equations  (273)  and  (274),  it  is  seen  that  the  theorem,  as  stated  in  equation 
(268),  simplifies  to  showing  that 

(RB^R*)  _1  ~CWn(m-n  +  k,  (RR*)  .  (275) 

Using  a  singular  value  decomposition,  the  matrix  R  may  be  factored  into 

R  =  U  [  Ifc  0  ]  VH,  (276) 

where  U  is  k-by-k  and  non-singular  and  V  is  n-by-n  and  orthogonal, 

v-i  _  v h  (277) 

Substituting  this  factorization  into  equations  (273)  and  (274)  results  in 

(rb-'r*)-1  =  (u[ifc  0  ]  VhB_1V  ^ 

=  (ua)_1  I*  0](V*BV)_1  )  U-1 

-  (u*)"1  ([  Ik  0  j  C_1  ^  )  U-1,  (278) 


where 


c  =  vhbv, 


(279) 
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and 


Let 


and 


( rrh )  1  =  (u  [  Ifc  0  ]  VHV 

=  [ ifc  o ]  r Ifc 

=  (uu^)-1. 


h 

0 


-1 


/ 

-1 


u 


H 


c  = 


Cn  Ci2 
C21  C22 


C_1  =  D  = 


D11  D12 
D21  D22 


(280) 

(281) 

(282) 


be  k-hy-n  —  k  partitions  of  C  and  D  =  C  1.  Then,  equation  (278)  becomes 


fRjr'n/'j  1 


(UH)_1  Cn^U-1, 


(283) 


where  Cn.2  is  as  defined  in  Theorem  A,  3  and,  as  seen  in  Muir  head  [24],  is  equal 
to  Dn1-  Since  V  is  orthogonal,  C  is  distributed  as 


C  ~  CWn  (m,  I„) .  (284) 

Applying  part  (i)  of  Theorem  A. 3,  it  is  seen  that 

Cu.2  ~  CWfc  (m  -  n  + /;,  Ifc) ,  (285) 

which,  when  applied  to  equation  (283),  results  in 

(RB-1^)-1  -  CWfc  ^m-n  +  ^UU^1) 

~  CWfc  (m-n  +  k,  (RRff) ,  (286) 

which  completes  the  proof. 


THEOREM  A. 5 

The  following  theorem  is  an  extension  of  Theorem  3.2.12  on  page  96  of  Muirhead 
[24]  describing  the  probability  density  function  of  the  ratio  of  quadratic  forms 
involving  the  inverse  of  the  scale  matrix  and  the  inverse  of  a  random  sample  of  a 
complex  Wishart  distribution.  It  is  interesting  to  note  that  the  resulting  distri¬ 
bution  does  not  depend  on  the  vector  in  the  quadratic  forms  if  it  is  independent 
of  the  complex  Wishart  distributed  matrix. 
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Theorem  A.5  If  A  ~  CW„  (m,  S)  where  m  >  n  -  1  and  if  y  is  any 
n-by-1  random  vector  independent  of  A  such  that  Pr  (y  =  0)  =  0, 
then 


ZyHA-ly 


v2 

n+1)  > 


(287) 


and  is  independent  of  y. 

Proof:  In  Theorem  A.4,  let  P  =  yH ■  Then, 

W  =  (yHA-1y)~1  ~  CWi  (m  -  n  +  1,  (y^y)"1)  ,  (288) 

which  has  probability  density  function, 


Mm-n+1-1etr(-f) 
fw{p)  =  T  (m  —  n  +  1)  \e\m~n+l 

_  _  w 


wm  ne  « 

r  (m  -  n  +  1)  0™-"+!  ’ 

(289) 

where 

0  =  (y^S-V)"1  • 

(290) 

Performing  the  transformation 

2  ytfS-iy 

Z  =  -EW  =  2yH._ 

0  yHA  xy 

(291) 

yields  the  desired  scaled  ratio  of  quadratic  forms  in  equation  (287). 
bility  density  function  of  Z  is  found  to  be 

The  proba- 

0(f)m-ne-f 

fz(z)  -  Y(m-n  +  l)29m~n+1 

^772 — 71 -|-  o 

r  (m  -  n  +  1)  2m~n+1 

2:2  Ae  2 

”  r(l)2s’ 

(292) 

where 

r  =  2  (m  —  n  +  1) , 

(293) 

which  is  a  central  Chi-squared  distribution  with  2  (m  —  n  +  1)  degrees  of  freedom. 


THEOREM  A. 6 
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The  following  theorem  is  based  on  a  result  found  in  DeVroye  [35] . 

r 

Theorem  A. 6  If  W  ~  and  V  are  independent  then 


Z  = 


w 

w  +  v 


Proof:  Consider  the  transformation 

(W,V)^(Z,Y), 

(294) 

where 

Y  =  V 

(295) 

and 

w 

(296) 

z  w  +  v' 

The  range  of  Y  and  Z  are 

o  <  y  <  oc 

(297) 

and 

0<Z  <1. 

(298) 

Inverting  the  above  one-to-one  transformation  yields 

V  =  Y 

(299) 

and 

-ZY 

(300) 

Wmz-r 

The  Jacobian  matrix  of  the  transformation,  the  partial  derivatives  of  (W,  V)  with 

respect  to  (Z,  Y),  is 

r  y  -z  -| 

J- 

(. Z-l )2  z- 1 

0  1 

(301) 

The  absolute  value  of  the  determinant  of  the  Jacobian  matrix  is 
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The  joint  probability  density  function  of  Z  and  Y  has  the  form 
Iz,y  {z,  y)  =  fv,w  (y,  -yzr- 


(303) 


If  W  ~  Xl  and  V  ~  X^,  and  are  independent,  their  joint  probability  density 
function  is 


fv,w  (v,w)  = 


f  m  1  —2L 

v  2  e  2 


t  n _ , 

«j2  e  2 


vr(m)2T;  V  I»2§ 

Substituting  equation  (304)  into  (303)  results  in 


(304) 


fz,y(z,y)  = 


m _ i 

p  2  e  2 


.  f-i  \ 

2  p2(z-l)  \ 


,r  (f )  2 


V 


re-fim 

2 


r  (S)  at 


m+n _ -i  ~~y 

y  2  1g2(l-z) 


r (f ) r (f) 2 
r  (f ) r  (l)  2“^ 


where 


and 


P 


m  +  n 


0  =  2(1  -z). 

The  probability  density  function  for  Z  is  found  by  marginalizing  over  Y , 

roo 

fz(z)  =  /  fz,Y  (z,  y)  dy 

Jy= 0 


(305) 

(306) 

(307) 


---i 

2 


---1 
2  1 


Z  2  1  (1  —  2) 

=  r  (?)  r  (l)  2^ 

(1  —  z) 

=  r  (?) r  (?)  2 
=  r(a±a) 
r  (?)  r  (?) 

where  it  is  recognized  that 

y 


roo 


yP  1e  e 

r(p)f 


dp 


7Z+m 

2 


.5-1 


22 


—  1  Tl  +  171  \  ,  ,  w  = 

r ( — —  )(2 (1  - z))  2 


(l-z)2 


(308) 


L 


00  :e  ® 


,  dy  =  1  (309) 

4=0  r  (p)  0? 

because  it  is  the  integral  of  the  probability  density  function  of  a  standard  Gamma 
random  variable  with  p  degrees  of  freedom  and  scale  6.  In  its  final  form,  equa¬ 
tion  (308)  is  recognized  as  a  standard  Beta  probability  density  function  with 
parameters  |  and  y. 


Appendix  B 


Non-Central  F  Distribution  Approximation 


As  seen  in  appendix  C,  the  non-central  Fisher’s  /  density  function  requires 
the  evaluation  of  an  infinite  summation.  As  evidenced  by  Tiku  in  [26],  there  exist 
several  methods  for  approximating  the  upper  tail  probability  of  this  distribution. 
The  Three-Moment  Approximation  is  indicated  as  the  most  reasonable  in  terms 
of  computational  requirements  and  accuracy.  The  probability  density  function 
may  be  evaluated  by  differentiating  and  negating  the  upper  tail  probability.  The 
following  outlines  this  procedure  as  applied  to  Tiku’s  Three-Moment  Approxima¬ 
tion  utilizing  Streit’s  [36]  description  of  the  upper  tail  probability  approximation. 


B.l  Upper  Tail  Probability 


Let  the  upper  tail  probability  of  a  non-central  Fisher’s  /  distribution  with  n\ 
and  n2  degrees  of  freedom  and  non-centrality  parameter  8  be  defined  by 


roc 

Q  (x\n1,n2,S)  =  f  (z\ni,n2,8)  dz. 

J  Z—X 

Tiku’s  Three  Moment  Approximation  states  that 

Q  (x\ni,  n2, 8)  «  I  , 

where  I  (y\a,  (3)  describes  the  incomplete  Gamma  function 

r  (<*  +  /?)  r  j.a—1 


1  (i|a' /J) = mm  L  t°"1  (1  ■  t)B~ 1 


and 


y  = 


-t-i 


i  + 


n2h 


(x  +  c) 


(310) 


(311) 


(312) 


(313) 
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t 


b  = 


n2  —  2  I  E 
2  \j  E- 4 


h  = 

c  = 

H  = 
K  = 
£  = 


bH 

niK  (2b  +  n2  -  2) 

n2  -  2  \  ni 

2  (/ii  +  5)^  +  (wi  +  35)  (n2  —  2)  1 
+3  (ni  -f-  5)  (tii  +  25)  (n2  —  2)  J 

(hi  +5)  +  (n2  ~  2)  (77.1  +  25) 

IP 

Ks' 


(314) 

(315) 

(316) 

(317) 

(318) 

(319) 


B.2  Probability  Density  Function 

The  non-central  Fisher’s  /  probability  density  function  may  be  described  as 

f(x\ni,n2,6)  =  ~-^Q(x\ni,n2,8) 


1  1  2  ’2 


d_ 

dx 

dy_ 

dx 


l7(*! 


n2  P 

T’ 2 


Differentiating  equations  (313)  and  (312)  yields  respectively 

-2 


dy 

dx 


\n2h/ 

by2 


1  + 


n2h 


(x  +  c) 


and 


d 


n2h 

r  (a  +  P)  _a-l 


dxI{x\(*,P)  T(a)T(p) 


x 


(1-x) 


y-\ 


Substitution  into  equation  (320)  results  in  the  approximation 


/  (x|ni,  n2, 5) 


r(=ys) 


-y 


^+1 


r  (f )  r  (|) 

where  y ,  5,  and  h  are  described  by  equations  (313)  -  (319). 


(1  -y) 


£-1 

2 


(320) 


(321) 


(322) 


(323) 


Appendix  C 


Statistical  Distributions 


For  the  convenience  of  the  reader,  a  list  of  the  probability  density  functions, 
along  with  certain  properties,  of  the  random  variables  encountered  in  this  dis¬ 
sertation  is  included  in  this  appendix.  This  material  may  be  found  in  most 
mathematical  statistics  texts  with  the  exception  of  the  chi-squared  and  Fisher’s 
/  non-central  random  variables  which  may  be  found  in  most  linear  models  texts. 
Some  specific  references  are  Manoukian  [37],  Muirhead  [24],  and  Johnson  and 
Kotz  [26]. 


Beta  X  ~ 

Beta  (a,  0)  a  >  0,  and  (3  >  0 

PDF: 

/  (*)  =  (1  -  V’1  0  <  *  <  1. 

Mean: 

E[X]  =  sfe 

Variance: 

Chi-Squared 

X  ~  where  n  is  a  positive  integer 

PDF: 

/  (x)  =  x  >  0 

j  \  t  2?r(f) 

♦ 

Mean: 

E  [X]  =  n 

Variance: 

Var  [X]  =  2  n 

A 

MGF: 

Mx  (t)  =  (1  -  2t)S  t  <  1 
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r. 


Complex  Multivariate  Normal  x  ~  CMn  (/a,  S) 

PDF:  /  (x)  = 

Mean:  E  [x]  =  fi 

Covariance:  E  (x  -  fx)  (x  —  [i)H  =  £ 

Complex  Wishart  A  ~  CWn  (m,  S) 

PDF:  /  (A)  =  ;^5«etr  (-£-1A) 

rm,„  =  r  (m)  •  •  •  r  (m  -  n  +  1) 

Mean:  E  [A]  =  mS 

Note:  If  Xj  ~  CNn  (0,  S)  and  A  =  Y$L  1  xixf  then  A  ~  CWn  (m,  S) 


Fisher’s  F  A  ~  /ni^n2  where  rii  and  n2  are  positive  integers 


PDF: 


/0*0  = 


r(=^)  nfnfx^-1 
r(W¥)  (n.+n,,)^ 


Mean:  E  [A]  = 


x  >  0 


n2>  2 


Variance:  Var  [A]  =  n^(!$^2S) 


n2>  4 


Gamma  A  ~  Gamma  (a,  /?)  where  a  >  0  and  (3  >  0 
PDF:  /  (x)  =  j^yxa-1e-/?1  x  >  0 

Mean:  E  [A]  =  § 

Variance:  Var  [A]  = 

MGF:  Mx{t)  =  (l-f)~a  t</3 


Non-Central  Chi-Squared  X 

integer 

«  e-f  (i) 

PDF:  /  (x)  =  £  —IT 

k= o  K- 

Mean:  E  [X]  =  n  +  <5 

Variance:  Var  [V]  =  2n  +  48 

MGF:  «*(«)-*£ 0 

Non-Central  F  X  ~  /nii„2  (5) 
integers 

~  e-l  (f) 

PDF:  /(*)  =  £  — ^ 

fc=0  /C- 

Mean:  E[X]  =  2fj±§ 

Variance:  Var  [V]  =  2 
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